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ABSTRACT 

The papers in this proceedings consider ways in which 
user friendly interface design may be used to make online systems in 
public and academic libraries easier to use, and thus more attractive 
to potential users. A short introduction by F. W. Lancaster is 
followed by nine papers: (1) "Linking the Unlinkable** (Michael 
Gorman); (2) *'Aristotle Meets Plato in the Library Catalog: Part 1** 
(Ward Shaw); (3) **Aristotle Meets Plato in the Library Catalog: Part 
2" (Ken Dowlin); (4) **Toward a Definition of User Friendliness: A 
Psychological Perspective** (Christine L. Borgman); (4) '^Is 'User 
Friendly' Really Possible in Library Automation?** (Dale K. Carrison); 
(5) **Ussr Interfaces for Online Library Catalogs** (Emily Gallup 
Fayen); (6) "Taming the Unfriendly System: Microcomputers as Patron 
Terminals to Access an Online Catalog" (Gary A. Golden); (7) "Natural 
Language User Interfaces in Information Retrieval** (tamas E. 
Doszkocs); (8) **Design Issues in Automatic Translation for Online 
Information Retrieval Systems" (David E. Toliver); and (9) **User 
Friendly Future: Applications of New Information Technology'* (Linda 
C. Smith). Seven of the papers include bibliographic references. 
(EW) 



************************** 

* Reproductions supplied by EDRS are the best that can be made * 

* from the original document. * 



******************* ******fr ********************************* ************ 



ERLC 



U.S. OCPAPVTMCNTOF i:OUCATK)N 
Otf»ce ot Educational Research and Improvement 

EDUCATIONAL RESOURCES INFORMATION 
CENTER (ERIC) 

yiThts document has been reproduced as 
received (rorr the person or organization 
oriQinating it 

O Minor changes have been made to improve 
reproduction quality. 



• Poi nts ol view or opi nions stated « n this docu' 
ment do not necessarily represent official 
OERi position or policy. 



What is User Friendly? 



Edited by 
F.W. LANCASTER 



"PERMISSION TO REPRODUCE THIS 
MATERIAL IN MICROFICHE ONLY 
HAS BEEN GRANTED BY 

H. Wagner 



TO THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (ERIC)." 



Graduate School of Library and Information Science 
University of Illinois at Urbana-Champaign 



Clinic on Library Applications 
of Data Processing: 1986 



7/ 



WHAT IS USER FRIENDLY? 



Papers presented at the 1986 Clinic on Library Applications 
of Data Processing, April 20-22, 1986 



© 1987 by The Board of Trustees of The University of IHinois 
ISBN 0-87845-072-6 ISSN 0069-4789 



CONTENTS 



Introduction 1 

F.W. LANCASTER 

Linking the JJnlinkable 2 

MICHAEL GORMAN 

Aristotle Meets Plato in the Library 

Catalog: Part 1 9 

WARD SHAW 

Aristotle Meets Plato in the Library 

Catalog: Part 2 15 

KEN DOWLIN 

Toward A Definition of User Friendliness: A 

Psychological Perspective 29 

CHRISTINE L BORGMAN 

Is "User Friendly" Really Possible in Library 

Automation? 45 

DALE K. GARRISON 

User Interfaces for Online Library Catalogs 52 

EMILY GALLUP FA YEN 

Taming the Unfriendly System: Microcomputers 

as Patron Terminals to Access an Online Catalog 61 

GARY A. GOLDEN 

Natural Language User Interfaces in 

Information Retrieval 80 

TAMAS E. DOSZKOCS 

Design Issues in Automatic Translation for 

Online Information Retrieval Systems 96 

DAVID E. TOLIVER 



ERIC 



6 



User Friendly Future: Applications of New 

Information Technology 108 

LINDA C. SMITH 

Index 119 



Introduction 



Considerable emphasis is now being given to the design or redesign of 
online systems in order to make them ''easier to use" and thus ''more 
attractive" to potential users. But do we really know what is meant by 
"easier to use" and "more attractive" in this context? These were the 
questions addressed at the Twenty-Third Annual Clinicon Librar> Appli- 
cations of Data Processing, held in the Levis Faculty Center, University of 
Illinois at Urbana-Champaign, on April 20-22, 1986. 

This volume contains the texts of the papers presented at the meeting. 
All of the authors explore the idea of "user friendly" as it applies toonline 
catalogs and related tools. Some of them summarize their own experiences 
in the implementation of online systems in academic and public libraries, 
some look at the broader psychological and social aspects of interaction 
between users and systems, and some attempt to predict what the future 
may hold for online bibliographic systems. 

This general overview of user friendly interface design should be of 
interest to all managers, systems analysis, consultants and other profes- 
sionals involved in the planning, development, and use of automated 
systems in libraries and information centers of all types. 

F.W. LANCASTER 
Edifor 
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Linking the Unlinkable 



The best advice I know on after-dinner speaking comes from a book of the 
Apocrypha: 

Let thy speech be short, comprehending much in a few wordi> (Ecclesias- 
ticus, 22:8). 

It was almost exactly ten years ago when I first spoke at a Clinic on 
Data Processing. At that time, I was merely a humble spear-carrier— a mere 
paper-giver. The keynote speech on that occasion was given by Frederick 
Kilgour. It was the first time that I had seen that eminent gentleman. He 
looked, then as now, more like the senior senator from Ohio than one of the 
leading innovators of modern librarianship. The years have rolled by and I 
find myself with the daunting task of following in the distinguished 
footsteps of the likes of Mr. Kilgour. Though my hair lacks the true 
senatorial silveriness which so distinguishes Fred, it has much more gray 
than it had in 1976. The amount of that graywhich is not due to heredity is 
due, in large part, to wrestling with the principles and practicalities of the 
online catalog (as we call it for convenience). It is the implications of one 
aspect of that automated bibliographic control system that I wish to 
discuss this evening. Specifically, the burden of my song is the idea of using 
microcomputers as the central component of a third way of achieving and 
extending developed online catalogs. (Incidentally, I must take full 
responsibility for the title of this keynote speech. My fondness for facetious 
titles [and, indeed, for facetiousness in general] has not dimmed with the 
years and I forced my waggishness upon my distinguished compatriot 
Professor Wilfrid Lancaster, who is hereby absolved of all responsibility.) 

I referred, a little while ago, to the fact that the term online catalog is 
now simply a term of convenience and one which is now so inaccurate as to 
be seriously misleading. (By the way, though I find the term unsatisfactory, 
I still prefer it to the horrid acronym OPAC [for "online public access 
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catalog'*]. Quite apart from theovcrtones of OPEQ there is the idea that an 
OPAC might l>e a political action committee which is dedicated to 
nothing— the zero-PAC.) The idea has never been that we should simply 
automate the pre-machine catalog (though, to tell the truth, some have 
tried to do just that), but that we should produce an online system which 
has at least three important differences from the prc-machine catalog. 

The first of these major differences is that the online system should be 
more responsive to the needs of the library user than is, say, the card catalog 
and will alJow many more ways of obtaining the information which is held 
in the system. This is readily achievable since even the worst computer 
systems are more responsive and forgiving than the card catalog ever was. 
Second, the online system should be more available to the user than its 
predecessors. By and large, we have achieved this second aim too by siting 
terminals in various locations in our libraries and communities, This has 
not been an invariable practice. Some libraries have been influenced by 
some ratherrum "studies'' of catalog use which have demonstrated conclu- 
sively that library users use card\.v,ialogs in places where those catalogs are 
sited. This clc'irly proves, to some, that terminals should be situated in the 
same place aj the card catalog. This zany logic leads to a loss of one of the 
great advantages of the online svstem. The third important difference, and 
the one with which I am primarily concerned this evening, Is that the 
online system \^ill contain far more information than its predecessors. 

In order to understand and examine this last point, we need to look at 
the situation which the users faced in using pre-machine systems. The 
fundamental problem was that the user's expectations were far higher than 
the capabilities of the bibliographic control system. He or she expected to 
be able to use the catalog to determine the availability of the materials 
sought. The catalog was not concerned with questions of availability but 
with questions of ownership. The user's question is, **Vbi est meumV 
(''Where is mine?"— Mike Royko*s proposed motto for the city of Chi- 
cago). The pre-machine catalog's dusty answer was 'The library owns, or 
believes it owns, this item." It has been amply demonstrated, in libraries 
and in the wider world, that, when answers do not match questions, a crisis 
of confidence results. The well-kept secret was, of course, that the informa- 
tion which was needed to answer the users' questions was scattered 
throughout numerous other files created and maintained by the library. 
Theon-order file, the binding file, the circulation file, the serial record, the 
serial check-in file... the 1 ist of these publ ic and private files was as extensive 
as it was dreary. Few librarians knew the ins and outs of all of these and 
almost all users were blissfully ignorant of their very existence. When I first 
came to the University oi Illinois Library, my then-assistant did a census of 
the paper files maintained in the technical services departments. They were 
more than sixty in number, of varying sizes and purposes. My favorite was 
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the **Dcad Slavic Serials File/* Surely the only thing on God's earth which 
is sadder than a dead Slavic serial is the memorial within which its demise 
is recorded! The task ofthecnlinereplacement forthepr/,-machinecaialog 
is to bring all this scaueied information together and to make it available 
to the library user. 

Since the beginning of computerized bibliographic systems in li- 
braries there has been a perception ihat there arc two ways of bringing all 
this previously scattered information together. To simplify, thediscussion 
has centered on the choice between integrated and separate systems with 
thr* smart money tending to favor the first. By an integrated system is meant 
one in which all the information about the materials held by or ordered by 
the library is stored and manipulated by an integrated set of programs 
within a single hardware configuration. Further, in an intc-grated system 
all this information is presented to the user at one terminal. On the other 
hand, separate systems would be those in which each function is carried 
out independently of c*ach other function. Some of these separate systems 
may even require separate central computers and separate and different 
terminals. This is an over-simplified picture because what has happened 
often in the real world has been that many libraries have created a hybrid of 
pariially integrated and partially separate systems. In this latter case, for 
example, the funcUons of rlie catalog and the circulation system might be 
integratc»d and the acquisitions and serial control functions miglu each be 
carried out by separate systems, 

Althougli tlie integrated system has been seen by most as tlie preferred 
alternative, the fact remains that few if any truly integrated systems have 
been achieved in medium-sized or large libraries. Even the partially inte- 
grated systems that have been achieved have been Ixrdeviled by the com- 
plexity of the software which is required to deal with a number of 
interrelated subsystems. Fitting the different data required for different 
functions into the Procrustean bed of the integrated system format has 
proved to be even more difficult. The concept of separate systems for 
separate functions has not been favored because it makes more work for the 
library user and l)ccause it is really no more than the automated version of 
the pre-machine system**. On the other hand, there are distinct advantages 
for the programmer and system specifier when it comes to creating a 
lailor-made system to carry out a specific function. Tlie clioice. then, has 
always seemed to be between the complex architecture of the integrated 
system nnd the user hostility of he separate system approach. There is. 
however, another possibility which may resolve the seemingly inescapable 
dilemma. That third way is made possible by the use of the 
microcomputer. 

Tliere is another dimension to this matter. It concerns the need for 
information other than the traditional bibliographic information found 
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in catalogs, order files, and (he like. That informaiion consists of informa- 
tion about serial articles (from indexes, abstracts, etc.), data in electronic 
form, and (though it is little more than a gleam in Fred Kilgour's eye) the 
full text of publications in electronic form. It is hare' wven to imagine the 
integrated system which would bring all this and the traditional kinds of 
bibliographic information together and even hgrder to realize such a 
system. It is almost depressing to think of the separate system concept 
applied in this area. Tlie thought of the library user being presented with 
twenty or more different terminals, each with its own commands and 
demands, is dismal indeed. Such an electronic Maginot Line woul<^ 
require staff resources which few libraries possess and would demand more 
application and effort from the user than any library has a right to expect. I 
wish I were as modern and progressive in these matters as Wilf Umcaster, 
but the fact remains that I still cling to the idea of the library more or less as 
we know it, to the notion that library service is intimately connected to the 
provision of information about printed materials (books, serials, etc.) 
well as to the more whiz-bangniaterials, and to the belief that new methods 
of communication su^jpiement rather than replace the older forms. 

Tl?e germ of tb.' Third Way— the alternative to both the integrated 
and separate system concepts— was born of the dilemma which we faced at 
the UIUC Library in combining a circulation system (LCS) and a MARC- 
based bibliographic system (WLN— Washington Library Network) to 
form our online catalog. For the moment all I need to mention is that we 
rejected both the idea of integrating the two systems (in any event a 
perilous and uncertain venture) and the idea of maintaining tx)th as 
separate systems (if for no other reason than this approach would have 
Ixrcn unfriendly, to say the least, to fhc library user). What we have done is 
to use miaocomputers (IBM PCs) as public terminals, to implant interface 
programs in those microcomputers which translate the user's natural 
language queries into i.ic arcane commands of the two systems, and to set 
up interactions between themiaocomputers and the mainframe computer 
which economize on telecommuniaition costs (in ihat the majority of the 
processing— and all the unproductive processing— is done in the micro- 
computer). Tliis is a small step for one library but one that is not without 
significance for library kind. The significance lies not in our local applica- 
tion but in the fact that two quite different systems are presented to the user 
as if they are one system. They have not been integrated but they do not 
stand alone. Tlie circle has been squared. Neither integrated nor separate, 
the systems are nevertheless in harmony w. .h the needs of the user. 
Remember also that these are completely different systems each with its 
own deep structure and each with its own economy and purpose. 

I would suggest that this modest beginning opens up important 
possibiliues for all online bibliographic systems and for the provision of 



ERIC 



12 



6 



MICHAEL GORMAN 



the kind of nonbibliographic information which I mentioned earlier. The 
essential point is that if, as we have demonstrated, one can design and write 
an interface program which links two completely different bibliographic 
systems then one could write such programs to link three, four, or five, or 
more such systems. In other words, the advantages of the separate system 
(that it is tailor-made for a particular function and performs its tasks with 
economy and efficiency) can be maintained in an environment which 
presents the user with the advantages of the integrated system (the bringing 
together and display of hitherto scattered and secret information). 

Having thus resolved the dilemma of integrated v, separate biblio- 
graphic systems, let us turn our attention to the nonbibliographic dfiiien- 
sion. This comprises three classes. The first is that of serial literature (what 
Dr. Ranganathan called "microthought"). We have traditionally given 
access to this kind of publication by means of printed indexing and 
abstracting services and latterly by online versions of such services. These 
services are inefficient, to say the least, because they are unorganized, 
random to a great degree, and because they are completely separate from 
the traditional bibliographic systems of the library. This is caused, in great 
part, by the fact that the indexing and abstracting services emanate from 
the for-profit sector. That sector is almost always philosophically and 
practically out of tune with the nonprofit sector which includes most 
libraries. The microcomputer, used intelligently, offers a way out of this 
problem too. If one can use a microcomputer to interact between two or 
mere incompatible bibliographic systems, then there is no reason why its 
use cculd not be extended to the interaction between bibliographic and 
indexing/abstracting systems. Those services could be either online or 
held as a local database using videodisc technology. 

Such an interaction of systems would go a long way, I believe, to 
refuting certain anomalous and erroneous findings of studies of early 
online catalogs. Those findings indicate that subject heading use increases 
dramatically when the move is made from pre-machine to online catalogs. 
It is my firm belief that this is a transitory phenomenon and that the 
increase in subject searches is partly due to the novelty of the online catalog 
and, in great part, to the fact that nothing better is available. I would 
predict confidently that, given easy and fre^ subject access to current serial 
literature online (as part of the microcomputer-coordinated total library 
system), subject searches for monograph literature would subside to the 
previous low level. The key words in the preceding sentence are "easy and 
free subject access." The question of making the access easy for the user (to 
conform to Mooers Law of Least Effort) is technical and relatively easily 
solved. The question of free access is one which is financial, strategic, and 
political. It involves the reconciliation of the for-profit and nonprofit 
sectors and can thus be regarded as, at very least, thorny. On the other hand, 

ERIC 



LINKING THE UNLINKABLE 



7 



if we are serious about using technology to move into a new dimension of 
library service, then I can see no better struggle upon which to embark. 

The second nonbibliographic class is that of data itself. There is, as 
has been pointed out often, an ever-growing mass of data available in 
machine-readable form. This data is not only available but is also, given 
the right programs, manipulate by the user. Again, there is no technical 
reason why such data and such programs could not be made available to 
the user, at the same terminal as the bibliographic and serial information, 
by the microcomputer controlled library system. This availability could be 
secured either to databases at remote locations or to locally held databases 
(again, perhaps using videodisc technology). 

Lastly, there is the question of the electronic publication (mono- 
graphic and serial in nature). Fred Kilgour (whose benign presence per- 
vades this paper) is currently engaged upon a research project called 
EIDOS which seeks to make the content of monographs in machine- 
readable form available to the user. This access will be primarily by 
"unconventional" means (searches of contents pages, captions, full text, 
etc.). Such techniques could be applied, together with more conventional 
access points, to serial publications in machine-readable form. When 
EIDOS is operational and when the volume and importance of electronic 
journals merits it, the microcomputer-controlled library system will reach 
out to engage these sources of information and knowledge and to bring 
them to the user. 

My message, then, is that the process of integrating and bringing to 
light the hitherto scattered information about library materials is most 
successfully achieved by miaocomputer coordination of separate and 
differing systems rather than by attempts at completely integrated library 
systems. Beyond this, that the quantum leap in service which has been the 
result of the creation of "online catalogs" will be matchedand exceeded by 
the next generation of library systems. Those systems will not only deal 
with bibliographic information but will also embrace the worlds of micro- 
thought, of data, and of publications in machine-readable form. All of this 
adds up neither to the demise of the library nor to the replacement of 
traditional means of communicating information and knowledge. On the 
contrary, it will lead to hitherto undreamed-of levels of enhancement of 
library service. Many years ago, Charles Ammi Cutter lamented the end of 
"the golden age of cataloguing." It is my firm belief that the library is on 
the threshold of a new Golden Age of bibliographic control and of provi- 
sion of nonbibliographic information, and that a prime tool in this renais- 
sance will be the humble microcomputer. 

Post scriptum: Since this paper was delivered on a Sunday and since it 
opens with a quotation from a book of the Jerusalem Bible, it seems fitting 
to record a Biblical quotation (for which I am indebted to Lowell Oxtoby 
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of Western Illinois University Library) on the topic of the importance of 
redundancy in computer systems: 

Two are better than one; because they hav^ a good reward for their 
labour. 

For if they fall, the one will lift up his fellow: but woe to him that is 
alone when he falleth; for he hath not another to help him up (Ecclesi- 
asies, 4:9-10). 

The autonomy and importance of the microcomputer in the systems 
which I envisage makes this exhortation of peculiar relevance. 
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Aristotle Meets Plato 
in the Library Catalog: Part 1 

This paper is part 1 of a presentation titled ''Aristotle Meets Plato in the 
Library Catalog." In it, I hope to set forth some aspects of the theoretical 
context, or point of view, from which we at the Colorado Alliance of 
Research Libraries (CARL) approach the design and implementation of 
what the organizers of this clinic have called "user friendly" systems, to 
desaibe a bit the organizational and systems setting within which we 
work, outline some of the design principles that guide our development, 
and provide a brief overview of the system as it exists today. Part 2, by Ken 
Dowlin, will discuss the system in an application context at the Pikes Peak 
Library District in Colorado Springs. The system in question is one 
developed by the Colorado Alliance of Research Libraries, and available 
for installation elsewhere through Eyring Research Institute, to whom we 
have granted a marketing license. It forms the basis for MAGGIE III, the 
system in Colorado Springs. 

The Theoretical Context 

You will be relieved to learn that this is not Philosophy 101 . However, 
as we try to address the question "What is user friendly?" it is important to 
uncover some basic assumptions that underlie our particular implementa- 
tion of a public system. Let us examine our theoretical context, with the 
understanding that all of it is emphatically arguable. First, a public 
catalog is an information system. Information is the name of a process; 
specifically, the process by which people become informed. The process by 
which people become informed is closely related to, or maybe the same as, 
learning. The name for sparking learning is teaching. Hence, an impor- 
tant characteristic of public systems is that they teach, and one measure of 
their utility is the effectiveness with which they teach. Teaching, as any 
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who follow debates surrounding educational policy will appreciate, is not 
well understood. One is led inevitably to the conclusion that we do not 
know what we are designing or at least that we do not have any guaranteed 
rules to follow. 

Aristotle was the champion of the a posteriori method. If he wished to 
learn about a i iangle, for example, he would analyze its parts and the 
mechanisms of their assembly by observation. He invented classification 
and, for all but the name, the scientific method. Plato, on the otherhand, 
concentrated on the a priori method of learning. If he wished to learn about 
a triangle, he would consider its ''triangleness" and draw logical conclu- 
sions from that concept. For him, the whole was both greater than and 
different from the sum of its parts. 

We have applied Aristotelean methodology with considerable skill 
and marvelous detail in the construction of our classification schemes, 
MARC records, analytics, authorities, etc. in the design of research li- 
braries and their traditional access tools. The method has served us remark- 
ably well in providing conceptual structures for managing and controlling 
enormous resources, and its use was dictated by the technologies available. 
The difficulty, of course, is that the tools we have constructed are complex 
in direct relation to the fineness of the analysis they represent and require 
of their users intimate knowledge of system structure as well as discipline 
structure. Divergence of the two structures is inevitable and extremely 
difficult to control. Part but not all of this difficulty is, to be sure, a 
function of the relatively inflexible (expensive) technology of their tradi- 
tional implementation. A large portion of the problem is that one must 
force one's thinking into the analytic patterns upon which the system is 
constructed, and it is thus exceedingly difficult to have new ideas. As 
McCluhan says, ''the medium resists" and mightily, 

Martin Heidegger, a twentieth-century phenomenologist, has written 
and spoken in detail about the concept of a tool, pointing out that a 
hammer, in the hands of a carpenter, is an extension of his arm. The 
carpenter uses the hammer to drive nails with wonderful efficiency and 
without thinking about it. While I do not have to know much about the 
hammer to pick it up, I must think about it in detail before I use it if only to 
avoid pounding my thumb. But Heidegger says that I am much more likely 
to conceive new uses for the hammer precisely because I see it as a tool for 
pounding rather than as a tool for driving nails. I look at its "hammer- 
ness," as might Plato, and draw a priori conclusions from the concept. In 
this case the medium also resists but potentially productively. And the 
more .general (or "platonic") the tool, the more productive the resistance 
might be. 

With electronic technology, the challenge is to enable users to manip- 
ulate our Aristotelian structures in Platonic forms, driving the systems to 
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explore what users conceive rather than what we have ''analyzed in." That 
we believe is the heart of ''user friendliness" and is the sense in which we 
offer the title of this presentation. It is the basic context from which we 
attempt design. 

The Organizational Context 

CARL is a private nonprofit corporation in Colorado that has as 
members the libraries of the University of Colorado at Boulder, the Univer- 
sity of Northern Colorado, the University of Denver, the Colorado School 
of Mines, as well as the Denver Public Library, and the Auraria Library 
which itself serves a consortium of three institutions of higher education in 
Denver. These are different kinds of organizations. They are state- 
supported, city-supported, and private. They are large geneial academic, 
large public, small special academic libraries. They differ in size from the 
University of Colorado — Boulder Library, a member of ARL; to ihe School 
of Mines Library serving a specialized academic clientele. They are also 
alike in certain important ways. They all have, as a part of their reason for 
being, the need to support graduate-level research, they all support large 
numbers of undergraduate students, and they all have a commitment of 
one kind or another to serve a wider user population than that of their 
immediate campus or city. 

Governance of CARL is via its Council of Members consisting of the 
directors of each of the member libraries. In addition, CARL has a board of 
directors (not to be confused with the library directors), but in practice 
policy is set by the council. 

CARL exists to create a single research resource for the various publics 
served by the member institutions. Said another way, CARL manages the 
collections of member institutions as if they were one co! lection. In order to 
accomplish this we have undertaken a whole series of network programs. 
The Colorado Organization for Library Acquisitions (COLA), for exam- 
ple, is a CARL program for cooperative acquisition of expensive material. 
It differs somewhat from other similar efforts in that the material pur- 
chased, although housed in the member libraries, is owned by CARL. We 
are developing a considerable collection enhancing those of the members. 
We also cooperatively purchase supplies and equipment for the members, 
when volume can generate savings. CARL's niajor program is the network 
online system. In order to create a single research resource, we needed one 
common mechanism to identify, locate, and control items throughout the 
network, and we also needed (and still need) a system for rapid, site- 
independent document delivery. 
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Design Principles 

In creating our r^ystem, we attend to several design principles which 
are derivable from the theoretical and organizational contexts just de- 
scribed. I haven't time, obviously, to discuss them in detail but will briefly 
outline a few of them to provide a flavor of our approach. 

First, the approach we use is heuristic, rejecting the algorithmic and 
simulation approaches as variously cumbersome, slow, and requiring 
impossible degrees of prior specification. As a result, our design principles 
are essentially statements of supposed value, and in some cases they are in 
direct conflict with each other. Each should be preceded by some substan- 
tial qualifier such as ''generally, in most cases, it is probably the case 
that....'* Negotiating between the principles requires constant trade-offs 
and modifications. Some principles regarding the overall system follow: 

— The system must make it easy for users to view the network as a whole. 

—The system must support local differences in both policy and practice. 

—The system must promote experimentation. 

—The system must provide very fast response time. 

—The system ought not to require the user to understand the structure of a 
bibliographic record or of its associated files but rather ought to pro- 
mote and support the construction of his own concept of organization. 
(We are indebted to Christine Borgman for alerting us to the idea of the 
user's ''conceptual map.") 

— The user must feel in control of the system, and not the other way 
around. 

—The system must adapt to the user's skill level. 

—The user should be able to get real results very quickly and then be able 

to experiment with variations very easily so that he may use the system to 

"explore." 

Some principles from a hardware/software point of view are: 

— Both hardware and software must be modular in design, allowing rela- 
tively easy changes to part of the system without dire consequences for 
the rest. 

— Pass constant values to software as data. 
—Separate message content and message form. 
—Keep data structures flexible. 
— Minimize disc accesses. 

And some principles relating to screen design: 

—Avoid library jargon and especially avoid computer jargon. 
—Keep screens uncluttered. 
—Avoid cuteness. 
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— Provide a cursor at the spot user typing will appear, and make that spot 

consistent from saten to screen. 
—Don't tell users they have done something wrong. Rather, let results 

speak for themselves and provide positive suggestions. Assume that 

users are in control. 
-—Don't use blinking fields or reverse video. 
— Systems have style. Keep it consistent. 
—Pay attention to layout as well as content. 

In summary, users know best what they do albeit sometimes with 
considerable professional help. System designers know best what the sys- 
tem can do. The goal of user friendliness is to provide a powerful, flexible, 
informative way for users to drive and control the system to their various 
ends. It is emphatically not to presume their ends or to channel their 
thinking according to predefined routes. 

The System Overview 

I'd like now to give you an overview of the CARL installation and the 
software (Pikes Peak Library District has a different configuration). The 
CARL hardware base is an eight-processor Tandem Nonstop II system. 
Each processor has 4 million bytes of main memory— 32 million for the 
current system. There are 6 billion bytes of disc memory for the files. In the 
six library sites, 390 terminals communicate with the system via various 
network communications equipment. 

Bibliographic records in the database come from all six institutions. 
From the system point of view, these records are organized in a common 
way and each field in each record contains an ownership bit map to 
indicate which institution "owns*' which field. From the user's point of 
view, however, the records are organized by institution— that is, the user 
searches and examines records one institution at a time. Early versions of 
the system required a cumbersome reentry of each search when switching 
from one institution's files to another's, and more recently we have made 
that switching extremely easy. Ultimately we v;ill support global searches. 
This progression was designed for political reasons. Individual institu- 
tions are wary of potential work loads on less heavily worked library 
subsystems such as interlibrary loans created by users from other institu- 
tions looking directly at their recc.ds. This fear has eased considerably 
with experience, partly because users who identify items they want at other 
institutions tend to go there directly rather than use traditional interlibrary 
methods to get the material. As these perceptions have ( hanged, the system 
has changed to reflect (lead?) new concepts. 
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The software is organized into four distinct modules. First, the back- 
ground software builds the database and creates the necessary indexing. 
Records are taken from OCLC, Autographies, and one or two other sources 
that members create as a result of their own cataloging activities. The 
software converts these records into our internal format and maintains the 
appropriate indexes. The various local fields are processed to create item 
records for circulation. The second software module is the public access 
catalog or PAC which provides searching of and switching between what- 
ever data are resident on the system. 

The third program module is the circulation system. This is a full- 
service system supporting charge, return, inquiry, holds, recall, tracers, 
cverdues, fines, lists, letters, reserves, conversion, statistical reports, and 
secured full edit control overall files and records. We interface directly with 
various academic computing centers for the transmission of accounting 
data generated by system activity. Of primary importance is that circula- 
tion status of items shows up instantly in PAC so that users have up-to-the- 
minute information about availability of items they discover. 

The fourth software module is bibliographic maintenance. Maint, as 
we call it, is used primarily for editorial changes to the MARC records. All 
fields are fully editable, and the program performs format checking and 
correction where appropriate to ensure MARC compatibility. Addition- 
ally, users can add and delete records. All changes are immediately pro- 
cessed and reflected in PAC and CIRC. 

The fifth module is acquisitions. Currently ready for beta test in one of 
the member libraries, it is scheduled for systemwide installation in the 
summer. The sixth module, serials control, is now in design. User access to 
these modules, as well as to Tandem or locally developed services, is 
available and secured through NEWPEX. 

The CARL database at the moment contains 1.85 million institution- 
unique bibliographic records and perhaps 3 million holdings. In addition 
to the 300 dedicated terminals, we provide free dial-up access to PAC, 
currently handling about 150 calls per day. We average about 1,200,000 
message transactions per day with an average response time of A seconds. 
By the end of 1986, we anticipate a database of 2.5 million records and 450 
dedicated terminals, generating 1.8 million daily transactions. Over 20,000 
people use the system on a typical day. 
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Aristotle Meets Plato in the 
Library Catalog: Part 2 



I will discuss the Colorado Alliance of Research Libraries (CARL) sybicm 
implementation as the basic housekeeping system at th^ Pikes Peak 
Library District (PPLD) on MAGGIE III and how PPjl.D has used the 
capabilities of the software in the CARL system to greatly expand and 
enhance MAGGIE'S PLACE (the computer system at PPLD). But first, a 
recap of the history of MAGGIE'S PLACE. 

MAGGIE'S PLACE 

The automation program at PPLD was started in 1975 and the first 
in-house computer, a Digital Equipment Corporation (DEC) PDP 11/70. 
was acquired in 1976. This computer was dubbed MAGGIE II with its 
namesake the long-time head of the technical services department, Mar- 
garet 0*Rourke. Over the next five years, through the hard work of the 
employees of the systems division of the library, MAGGIE'S PLACE 
became one of tlie most comprehensive and sophisticated librar/ auto- 
mated systems in existence. Program implementation started with a colkc- 
tion inventory system, then proceeded through circulation, acquisitions, 
serials, a public access catalog, and continued on to payroll, accounting, 
word processing, electronic snail, and other housekeeping tasks. PPLD 
broke new ground for the library world when community resource files 
were brought up on MAGGIE II in 1978. These files contain community 
agencies, clubs and organizations, adult education courses, an events 
calendar, and day-care centers. In 1981 PPLD added a second DEC compu- 
ter, a PDP li/44, to initiate the first community-wide public online 
CARPOOL system in the United States and later brought up a transit 
information system which provides the online schedules of the cuy bus 
system. Also in 1981 PPLD became the first library to allow owners of 
home microcomputers and business computers to link with MAGGIE for 
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searching the public access catalog and (he community resource files. By 
1983 the catalogs ai (he libraries of the U.S. Air Force Academy and (he 
University of Colorado ai Colorado Springs were online in addition to the 
eight facilities of PPLD. 

By 1983 MAGGIE had grown into a system with ninety-seven termi- 
nals and 1.8 billion byres of storage. Unfortunately, this Has alx)ve the 
maximum effective capacity for an 1 1/70 in the kind of activities required 
by PPLD. In 1983 the voters of PPLD approved a bond issue for a facility 
that would increase the total square footage of the district by 80 percent. 
This megabranch would require a minimum of fifty terminals— an impos- 
sible addition to MAGGIE IL Fortunately the bond issue included funds 
for an entirely new computer system. After th . passage of the lx}nd issue the 
specifications for the replacement system w.'e developed, a Request for 
Information (RFI) was issued, and based on responses from a number of 
potential vendors, a Request for Proposals (RFP) was distributed. 

MAGGIE III Specifications 

The RFP was a statement of the functions that were required on the 
system and the performance standards that were vjxpected. These perfor- 
mance standards weie based on the leading edge of development of existing 
hardware and software. A discussion of these standards follow. 

Reliability 

It is required that MAGGIE III be available as much as— if not more 
than—any other system on the market. It is expected that the system would 
be available over 99.9 percent of the time. 

Capacity 

The initial system must support 300 terminals operating within stated 
response time limits, and the architecture must allow growth to 1200 
terminals without making initial hardware and software obsolete. 

Expandability 

It must be possible to add additional devices in increments that pro- 
vide an even growth curve. In other words, it should be easy to add 
piocessor power or disk storage in increments with predictable costs; 
again, this must be done without making initial hardware or software 
obsolete. 

Speed 

The system is required to perform at an extremely high level of 
throughput. For example, the average charge-out of a \yook should be no 
more than two seconds when 300 terminals are on the system. 
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Mainsireamed 

The hardware, operating system, programming languages, and peri- 
pheral devices should be standard off-the-shelf products. Terminals 
should be available from a number of vendors and sI)ould be low priced. 

Vendor-Supported Housekeeping Programs 

In a major departure from past practice, the decision was made to seek 
software for circulation, for a public access catalog, for acquisitions, and 
for serials from a vendor, if the program met our needs and ongoing 
support was available. If these programs were not available, tlien PPLD 
staff would continue to develop tlie programs. 

Iri'house Enhancement 

PPLD wanted to retain the ability to develop applications in-house or 
to add packages from other vendors sucli as a financial package. A report 
and query language was required from the computer vendor tliat would 
allow PPLD staff to interact with the applications supplied by tlie vendor. 
The system should provide a database manager and other utility programs 
which decrease development time significantly. 

Technical Ability 

It was required tliat the system provi the best teclinical features of 
any system available on the market and tliai die arcliitecture for tlie system 
would be optimized for PPLD needs. 

**User Friendliness*' 

Even in 1983, the staff of PPLD— and tlie patrons— liad several years 
of experience with a public access catalog and had developed a number of 
definite prejudices. To make everyone happy, the system would liave to be 
accurate, fast, and powerful, but simple as well. Since PPLD liad over 3000 
users accessing the system from tlieir liome corriputers, ease of use took 
priority, PPLD staff traveled all over tlie country in order to evaluate tlie 
user friendliness of the proposed systems, 

CARL Expanded Ability 

None of tlie initial proposals met the PPLD specifications, and it was 
only when Eyring Researcli Institute, Inc. proposed a system using Tan- 
dem Computer Corporation hardware, tlie liousekeeping software from 
the Colorado Alliance of Research Libraries, that PPLD signed a contract, 
mindful of Eyring s expertise in installation, documentation, and train- 
ing. In addition, Eyring agreed to incorporate some clianges that were 
desired by PPLD staff. A detailed analysis of PPLD staff determined tliat 
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ihc CARL system could provide some capahiliiics beyond the other 
vendors. 

Database Manager 

The CARL Publ ic Access Catalog (P AC) could also ser\'e as a database 
manager. Tlie unique search routine using word, name, or browse is 
neutral as to content of the file. In other words, a file of clubs can be 
searched by word, name, or browse as easily as the catalog of books. Since 
the CARL system uses the MARC record, which allows variable-length 
fields within variable-length records, the PAC can be used to file almost 
anything. The system ise:isily explaitied to the average user by stating that 
it is as if every word and name on the catalog card is indexed, and evc<7 
IX)ssible combination of the words on the catalog aird can l>e used. Perhaps 
it should 1>Q lal>eled *'thc vacuum cleaner*' approach to indexing since the 
number of access points to a specific record isin the hundreds(one i)ossible 
way to calculate the number is tocount the words in the record and use that 
numlK*r factored). 

A Network Systerji 

The CARL Public Access Catalog was designed to allow the user to 
choose among the catalogs of all of the members of CARL, which facili- 
tates choosing not only among different types of librar>'*s cualogs but 
among files as well. It is anticipated that MAGGIE will Ix! connected 
directly to the CARL network, thereby providing access to over 80 |XJrceni 
of the f iiles in public and university libraries in Colorado. 

A Database Manager Supervisor 

Not only a\i\ the user of the CARL system switch among aualogs of 
different libraries, but the initial seiirch may also be repeated in each 
cualog automaticdly. It is anticipated that a global search of all files will 
soon Ix: possible. 

These expanded capabilities of the Eyring system fit the needs of 
PPLD extremely well. Since PPLD views its mission as one of community 
information center :>nd community communications center, as well as of 
traditional published materials center, it is essential to have a system with 
which to aeatc online database systems. The ability of the system to allow 
in-house design and creation of new databases in a nominal amount of 
lime places PPLD on the leading edge of agencies providing community 
information. A new file was designed and implemented, and data loading 
began in less than ten days by PPLD staff. This file, called ''KWIKREF' is 
for miscellaneous information developed by staff research that the librar- 
ians wish to retain indexed by every word and name. 
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MAGGIE III Implementation 

A contract was signed with Eyring Research Institute, Inc on 29 
March 1985; the hardware was delivered four days Sater and was installed in 
another two weeks. The CARL software was installed within another two 
weeks, and a circulation system and a public access catalog system were 
fully ff actional five months after contract signing. The hardware for 
MAGGIE III consists of four Tandem Non-Stop TXP 32-bit processors 
with four megabytes of memory each, four V8 diskdri ves with acipacity of 
3 bilHon bytes, a high-speed tajK drive, a high-speed printer, and the 
cabinetry, etc. for 300 terminals. The terminals purchased are primarily 
Lear Sigler ADM 12s that cost less than S600. Since the system is quite 
flexible on the selection of terminals, all of the old terminals from MAG- 
GIE II are usable on the PAC. Tlie system has exceeded performance 
specifications significantly. It appears tliat the system will handle more 
than £00 terminals and maintain the current res|>onse time . less than a 
second to charge a book. The system has Ijccn operational more than 99.9 
percent of the time in the first nine months, and the public is impressed 
with the ease of use of the Public Access dialog. 

PPLD is extremely pleased wi^^ the implementation of the system. It 
lias performcHl at levels exceeding sfx-cifications in all functions. The 
Community Resource Files that were present on MAGGIE II liave iK^en 
implemented on MAGGIE III with the significant improvement of con- 
sistent saeens, terminology, and search strategy among all files. In addi- 
tion, a catalogof documents created by local government agencies has iKcn 
implemented. Tlie time to develop a local database on the system appears 
to he only 10 |)erc<?nt of the time required on MAGGIE II. 

A Look at the PAC 

A look at the saeens of the PAC will illustrate the excellent user 
friendliness of the C/\RL PAC. It should be noted that the screens have 
iKen merged into exhibits in order to make the presentation more compact. 
Tlie text is as it appears; I have simply eliminated themajority of the blank 
lines on the CRT saeen. 

Exhibit I shows the initial screen seen on PAC terminals in libmry 
facilities and on home computers linked via dialup. The HELP choice u\ 
this point simply explains the contents of each database. 

Exhibit 2 shows the screen that appears after selecting number 1, the 
ON-LINE CATALOG. This screen explains the type of searches that are 
available. Next on the saeen is what appears after selecting W for Word 
search. Different examples are provided for each file in order to be more 
relevant to the user. The next se\'eral lines show the result of the user 
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(The computer screens have been merged in order to provide a 
savings in space. The text you see was downloaded directly into 
the wordprocessor from the p;«C) 



WELCOME TO THE COMPUTER CATALOG OF LIBRARY HOLDINGS 
(version 50) 

PIKES PEAK LIBRARY DISTRICT 

A project to the Eyring Research Institute and the 
Colorado Alliance of Research Libraries (CARL) 

PRESS (RETURN) TO START PROGRAM: 

PAC WORKING... 

Your first step is to select the LIBRARY whose catalog you wish to consult. 
Catalogs are currently available for; 

X. OU-LINE CATALOG 

2. CALENDAR 

3. AGENCY 

4. CLUB 

5. COURSES 

6. LOCAL DOCUMENTS 

7. KWIKREF 

8. KELP..* 

9. DAYCARE 



TYPE the NUMBER of the library you wish to search, and 
press tne (RETURN) )cey. 

ENTER NUMBER: 1 
WORKING.. . 
09/22/86 



Exhibit I 



entering the words transportation planning. The sequence of the words is 
irrelevant and the user may enter word stems if he or she is not sure of the 
complete word. For example, plan might have been used to expand the 
search to include plan and plans. The catalog contains 175 titles under the 
term transportation and ten titles under transportation + planning. At this 
stage the user asks for a list of those ten items. 

Exhibit 3 shows the list of the first seven hits on transportation 
planning. When the user selects one of the numbers the screen containing 
the record is displayed. This screen is the complete MARC record, with call 
number, facility location, and status. At the bottom of the screen the system 
provides the option of repeating the search on another database. By enter- 
ing S, the user may return to the screen to choose another database. When 
the user enters the number 7, the search is repeated on the LOCAL DOCU- 
MENTS database as shown on exhibits 4, 5, and 6. Because of the large 
number of documents in the database on transportation planning, it is 
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02:40 P.M. SELECTED CATALOG : ON-LINE CATALOG 

The computer can find books by NAME or by WORD 

NAMES can be authors, editors, or names c£ 
persons or institutions written about the book 

WORDS can be words from the title, or subjects, 
concepts, ideas, dates etc. 

You may also BROWSE by TITLE, CALL NUMBER, OR SERIES. 

Enter N for NAME search 

W for WORD search 

B to BROWSE by title, call number or scries 

S to STOP or SWITCH to another Library catalog 



There is also a quick search — type QS for details 

Type the letter for the kind of search you want, 
and end each line you type by pressing <RETURN> 
SELECTED CATALOG : ON-LINE CATALOG 



ENTER COMMANL ^ W 
WORKING... 



REMEMBER — WORDS can be words from the title, or can be subjects, 
concepts, ideas, dates, etc. 

for example — GONE WITH THE WIND 

SILVER MINING COLORADO 

BEHAVIOR MODIFICATION 
enter word or words (no mere than one line, please) 
separated by spaces and press <RETURIJ>. 

>TRANSPORTATION PLANNING 

WORKING... 
TRANSPORTATION 00174TITLES 
TRANSPORTATION + PLANNING OOOICTITLES 

For the OOOlO items that have 
TRANSPORTATION + PLANNING 

Press <RETURn>, or type <Q>UIT for a new search. 
WORKING. .. 



Exhibit 2 



necessary to add additional terms to narrow the search. The user can scan 
short entries for all items if he/she wishes. 

Exhibits 7 and 8 show the search transportation planning repeated on 
the Agency Database with the list of short entries in that file. The system 
provides an interesting information resource scanning facility, I doubt 
that most transportation planners would think of the Girls Club or day- 
care centers as entities involved in transportation, but the file shows that is 
certainly the case. 

" -28 
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PREPARING YOUR DISPLAY — HOLD 0N.». 

1 Meyer michael d PPLD see record 1984 
Urban transportation planning : a decision-orie 388*4068 H613u 

2 PPLD see record 1983 
Metropolitan transportation planning 388.4068 M594 

3 Foster mark s PPLD see record 1981 
Prom streetcar to superhighway : american city 771»70973 F756f 

4 PIKES PEAK PPLD see record 1979 
Notebook for first transportation planning worhc 711.7 P6369N 

5 PPLD see record 1976 
Out of cars, into transit : the urban transport 388»40973 094 

6 Citizens' goals colo PPLD see record 1976 
(citizens* goals background papers] 307»760978C581B 

7 GRAY,GEORG PPLD see record - nd 
Public transortation: planning, operations, and 388.4 G779P 

<RETURH> TO CONTINUE DISPLAY 

ENTER <LINe. NUMBER> TO DISPLAY FULL RECORD 

<Q>UIT FOR NEW SEARCH 3 
WORKING* 

AUTHOR(s) : Foster, Mark S* 

TITLE(s}: From streetcar to superhighway : American city planners 

and urban transportation, 1900-1940 / Mark S. Foster. 
Philadelphia : Temple University Press, cl981. 
xiv, 246 p* ; ill. ; 24 cm* 
Technology and urban growth 
Includes index* 
Bibliography: p* 235-237* 

OTHER ENTRIES: Urban transportation policy United States History* 
City planning United States History. 

CALL #: 771* '0973 F756f LOCN: PENROS 

STATUS: Not checked out — 

<RETURN> to continue, <0>UIT for a new search, or <R> to REPEAT this display 
Q 

You began with a W search on: 
TRANSPORTATION PLANNING 

Type S to try your search in another catalog, or 

R to repeat your search in ON-LINE CATALOG or 
<RETURN> for a new search 



Exhibits 
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Your initial search was: 

TRANSPORTATION PLANNING 
Select the catalog you wish to try next: 



1. 


ON-LINE CATALOG 


2. 


CALENDAR 


3. 


AGENCY 


4. 


CLUB 


5. 


COURSES 


6. 


LOCAL DOCUMENTS 


7. 


KWIKREF 


8. 


HELP... 


9. 


DAYCARE 



TYPE the NUMBER of the library you wish to search, and 
press the <RETURN> key. 

EUTER NUMBER: 6 

WORKING... 

this takes a sec... 

SELECTED CATALOG: LOCAL DOCUMENTSB 
TRANSPORTATION 01134TITLES 
TRANSPORTATION + PLANNING 00761TITLES 

For the (00761) items that have 
TRANSPORTATION + PLANNING 

Press <RETURN>, or type <Q>UIT for a new search. 
WORKING... 



TRANSPORTATION + PLANNING 00761TITLES 



You may make your search more specific (and reduce 
the size of the list) by adding another word 
to your search. The result will be items in 
your current list that also contain the new 
word. 

to ADD a new word, enter it, 
<D>ISPLAY to see the current list, or 
<Q>UIT for a new search: 

NEW WORD(S): COLORADO 
WORKING... 

TRANSPORTATION + PLANNING + COLORADO 00137TITLES 

You now have: TRANSPORTATION + PLANNING + COLORADO 00137TITLES 



Exhibit 4 
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you may make your search more specific (and reduce 
the size of the list) by adding another word 
to your search* The result will be items in 
your current list that also contain the new 
word* 

to ADD a new word, enter it« 
<D>ISPLAy to see the current list, or 
<Q>UIT for a new search: 

NEW WORD(S) : SPRINGS 
WORKING* * * 

TRANSPORTATION PLANNIUG * COLORADO ■*- SPRINGS 00081 TITLES 



You now have: TRANSPORTATION + PLANNING + COLORADO + SPRINGS 00081TITLES 

you may make your search more specific (and reduce 
the size of the list) by adding another word 
to your search* The result will be items in 
your current list that also contain the new 
word* 

to ADD a new word, enter it, 
<D>XSPLAy to see the current list, or 
<Q>UIT for a new search: 



NEW WORD(S): HIGHWAYS 
WORKING*** 

TRANSPORTATION PLANNING -i^ COLORADO * SPRINGS + HIGHWAYS 00009TITLES 

You now have: TRANSPORATION * PLANNING * COLORADO -f^ SPRINGS * HIGHWAYS 
00009TITLES 

You may make your search more specific (and reduce 
the size of the list) by adding another word 
to your search* The result will be items in 
your current list that also contain the new 
word* 

to ADD a new word, enter it, 
<D>ISPLAY to see the current list, or 
<Q>UIT for a new search; 

NEW WORD(S) : D 



Exhibit 5 



The Future of MAGGIE'S PLACE 

There are several new databases on the drawing boards and, based on 
the implementation of the current databases, it appears that new ones will 
be created on a regular and frequent schedule. Plans may change depend- 
ing on circumstances, but at present the several databases are planned. 

Facts 

Pierian Press has provided a demonstration tape of the data contai-..d 
in its serial entitled ^4 Matter of Fact, It appears that loading this data into 
the PAC will be relatively routine, Pierian Press will send out updated 
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PREPARING YOUR DISPLAY — HOLD OH... 

1 Agency Colorado depa PPLD PENROS LOHIST - nd 
Title: U.S. highway 24 bypass Colorado springs, CRD01-HWY/EI-T83 (1976) 

2 Agency pikes peak ar PPLD PEHROS LOHIST - nd 
Title: study of access routes to peterson field REGtPPACG/SP-P37(1967) 

3 Agency planning divi PPLD PENROS LOHIST - nd 
Title: transportation plan, city of Colorado spr CS:CD-PL/SP-T61(1986) 

4 Agency ridcfinders t V>PLD PENROS LOHIST - nd 
Title: a proposal for alternative transportation 0RG'»-RI0ry5P-G16(1984) 

5 Agency department of PPLD PENROS LOHIST - nd 
Title: traffic and preliminary engineering study CRD0:HWY/SP-G16(1986) 

6 Agency pikes peak ar PPLD PEHROS LOHIST - nd 
Title: Colorado springs long-range plan update s REG:PPACG/SP-T851 (1984) 0 

7 Agency Colorado depa PPLD PENROS LOHIST - nd 
Title: widefield, el paso county; draft environm CRDO+-HWY/EI-W42(1971) 

<RETURN> rO CONTINUE DISPLAY 

ENTER <L1HE NUMBER> TO DISPLAY FULL RECORD 
<0>UIT FOR HEW SEARCH 7 

WORKING... 

AUTHOR(S): agency: COLORADO DEPARTMENT OF HIGHWAYS 

TITLE(s): title: widefield, EL PASO COUNTY; DRAFT ENVIRONMENTAL 

IMPACT STATEMENT ADMINISTRATIVE ACTION 

PUB date: SEPTEMBER 30, 1971 

abstract: this statement discusses the proposed route for 
the extension of state highway 16. the extension discussed 
continues highway 16 easterly to intersect the extension of 
marksheppel road* both routes have been accepted as part of 
the colorado sprimos metropo: tan area transportation study 
for the colorado springs areh. 

OTHER entries: KEY WORDS: TRAFFIC - PLANNING; HIGHWAYS; SOCIO-ECONOMIC 
ANALYSES; MASS TRAHSPOR'^ATION; CRDOf HWY/EI-W42 ( 1971) 
DOC T'/PF.: ENVIRONMENTAL IMPACT STATEMENT 
GEOG area: widefield 

features: maps, PHOTOGRAPHS, DIAGRAMS 
GOVT level: state 
xdohor: XPPACGLIBRARY 



call #: CRDOfHWY/EI-T83(l976) 



library: 



PENROS LOHIST 



Exhibit 6 



tapes as a subscription with fixed prices. By the time this article appears in 
print, the users of PPLD will be routinely searching the data contained in 
the 'TACTS" database by any word or name. It should be very popular 
with home users since there will be rens of thousands of facts with citations 
of sources available in their own homes. 

Reviews 

Thn editor of Pierian Press has indicated that they would like to create 
a database consisting of most of the book reviews contained in magazines 



ERLC 



32 



26 



KEN DOWLIN 



<RETURN> to continue, <Q>UIT for a new search* or <R> to REPEAT 

this display 

Q 

You began with a W search on: 



TRANSPORTATION PLANNING 

Type S to try your search in another catalog, or 

R to repeat your search in LOCAL DOCUMENTS or 
<Rr,TURN> for a new searchrS 

Your initial search was: 

TRANSPORTATION PLANNING 

Select the catalog you wish to try next: 

1* ON-LINE CATALOG 

2« CALENDAR 

3. AGENCY 

4* CLUB 

S. COURSES 

6* LOCAL DOCUMENTS 

?• KWIKREF 

8* HELP*** 

9. .^AYCARE 

TYPE the NUMBER of the library you wish to search, and 
press the <RETURN> key. 

ENTER NUMBER: 3 

WORKING*** 

this takes a sec*** 

SELECTED CATALOG: AGENCY 

TRANSPORTATION 00027TITLES 

TRANSPORTATION + PLANNING 00002TITLES 



PREPARING YOUR DISPLAY HOLD ON*** 



1 Pikes peak area council of governments 



PPLD 

AGENCY FILE 



2 Downtown Colorado springs, inc* 



PPLD 

AGENCY FILE 



ALL ITEKS HAVE BEEN DISPLAYED* 

ENTER <LINE NUMBER> TO DISPLAY FULL RECORD 

<Q>UIT FOR NEW SEARCH 1 
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WORKING. 

TXTLE(S): PIKES PEAK AREA COUNCIL OF GOVERNMENTS 

ADDRESS: 27 E. VEKHIJO, 5TH FLOOR 
CITY, ST, ZIP: COLORADO SPRINGS, CO 8O903 
HOURS: 8-5 M-r 

telephone: 471-7080 
parent org: ppacg 
director: david l. peterson 
contact: dian sukalski 

other entries: function*. regional planning agency for el paso, park and 

TELLER counties; REGIONAL CLEARINGHOUSE FOR FEDERAL 
FINANCIAL ASSISTANCE (A-95 REVIEW); OFFICIAL CENSUS DATA 
USER CENTER PROVIDING CENSUS DATA, HAPS, STATISTICS, ON 
HOUSING AND POPULATION; PROVIDE PLANNING SERVICES IN 
TRANSPORTATION, AGING, ENVIRONMENTAL QUALITY, HOUSING, 
STATISTICS/ECONOMIC FORECASTS, ETC; MAINTAIN EXTENSIVE MAP 
LIBRARY AND PLANNING LIBRARY. HANDICAPPED ACCESS. 

ELIGIBILITY: LOCAL GOVERNMENTS, PRIVATE ACCESS. 

APPLICATION: PHONE, WALK IN 

KEYWORDS: PLANNING GOVERNMENT LOCAL 

DATE ENTERED: 06-Mar-8l 
DATE UPDATED: 08-Apr-86 

AGENCY: PIKES PEAK AREA COUNCIL OF GOVERNMENTS 
CALL «: AGENCY FILE LIBRARY: 



<RETURN> to continue, <Q>UIT for a new search, or <R> to REPEAT this 
display 

You began with a W search on: 
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throughout the United States. They would supply monthly update tapes of 
PPLD so that current reviews would be available in the PAC. It would be 
an extremely valuable source for librarians involved in book selection, and 
the public would be able to view a number of reviews of a book prior to 
selecting its reading material. 



Voter Information 

The Pikes Peak area has many representative government divisions in 
which the citizens need to participate through elections. There is a ple- 
thora of districts as wel 1 as the cities and the county. El Paso County alone 
has twenty-six school districts. The polling places for elections can be 
difficult to find. 



City Code 

The city attorney for Colorado Springs has asked PPLD to assess the 
feasibility of providing dial-up access to the city legal code. This code is 
modified practically every time the city council meets and it is very expen- 
sive to reprint the code every two weeks. 
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LINK 

There are many people willing to teach those who want individual- 
ized instruction on almost any topic. The LINK file would simply be a 
directory of people and their talents. 

Conclusion 

User friendliness must move beyond traditional, esoteric cataloging 
practice as embodied in the card catalog. It means simple commands that 
are very powerful, operating on multiple databases that are relevant to the 
local user community. There is little question at PPLD that the methods of 
Plato and Aristotle exist side by side in the online catalog. MAGGIE'S 
PLACE has entered a new era. 
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Graduate School of Library and Information Science 
University of California— Los Angeles 

Toward a Definition of User Friendly: A 
Psychological Perspective 

Introduction 

"User friendly*' is one of those valuable concepts that has become such an 
overworked phrase that it has lost much of its meaning. As Meads notes, 
"[t]he forced grin of user friendliness becomes a mask for lack of capability, 
insufficient performance, costly maintenance, or a collection of mis-fitting 
components."^ User friendly is not merely the addition of high tech hard- 
ware such as a mouse, icons, or three-dimensional graphics. 

What does user friendly mean? First, consider a dictionary definition. 
Webster's defines user simply as "one who uses.'*^ Friendly is defined as 
"of, relating to, or befitting a friend: as a: showing kindly interest and 
goodwill b: not hostile c: inclined to favor d: comforting, cheerful/*^ We 
infer that "user friendly" suggests an entity that is warm and comforting to 
the one who uses it. 

Matthews and Williams defined a "user friendly index" for informa- 
tion systems as a nine-point scale, ranging from "user intimate" at the top 
to "user vicious" at the bottom, with "user oriented" as a midpoint.^ They 
have followed the same line as Webster's, considering "user friendly" as 
being kind— or at least the inverse of hostile— to the one who uses. 

Meads takes the definition of user friendly further by stating three 
requirements. The first is that the system is cooperative^-ii provides active 
assistance during the task and makes its actions clear and obvious. Second, 
the user friendly system is preventive--ii acknowledges that people make 
mistakes by preventing those mistakes to the extent possible and by provid- 
ing backout and recovery procedures. Third, the friendly system is 
conducive --ii is reliable, predictable, and assists rather than controls the 
user.® 

Meads's three requirements can be combined into the one attribute 
transparency, a commonly used term from computer science. If a system is 
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transparent to the user, it means that the user is looking through the system 
to the task being accomplished and not focusing on the system itself. A 
transparent system is one that supports and simolifies a task rather than 
becoming a task in and of itself. This paper will discuss current research on 
information system;, that has the implicit goal of ma'ung systems more 
user friendly and that is being conducted from a psychological perspective. 

The Human as a Unit of Analysis 

The interaction of humans with computers can be studied at multiple 
levels of analysis. Here we are concerned with the psychology of the user, 
which is roughly a mid level unit of analysis. By psychology we mean the 
study of human behavior— i.e., mental and behavioral characteristics as 
they apply to the use of computers. The research done in the area is largely 
based on the theories of cognitive psychology. Studies areol the individual 
user as representing the larger body of users. 

Kuman-computer interaction can be studied at both lower and higher 
levels of analysis than that of the individual user. At a lower level would be 
the human factors studies that focus on anthropomorphic dimensions of 
the human: fitting the keyboard size and layout to the average human 
hand, designing workstations with the proper dimensions for human 
comfort, screen displays that minimize glare and eyestrain, and so on. 

At a higher level of analysis than the individual is the study of the 
organization or the social group response to the use of computers. The way 
in which people use computers is affected by the way in which the systems 
are introduced, their motivation to use them, the training provided, the 
threats to the current job, and changes in task and work structure. 

All of these levels must be studied to provide a full picture of human 
use of computers and hence of friendliness. However, they cannot all be 
studied at once. In this paper we confine ourselves to the study of the 
individual user. 

A Psychological Perspective 

Researchers in academic departments of psychology, communication, 
computer science, and library and information science, as well as indus- 
trial researchers, have been applying both psychological theory and 
method to the study of human interaction with computers. In addition, 
psychologists have used the study of human behavior with interactive 
systems as a test-bed for developing theory and method. 

The remainder of the paper will cover two distinct bodies of research. 
First we cover psychological theories that have been applied directly to 
interactive computer systems. Some theories already have been applied to 
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information systems; others are better proven elsewhere but havcpoieniial 
for use in tliis domain. 

The second body of research to be addressed is studies done to charac- 
terize behavior on information retrieval systems— both online catalogs and 
bibliographic retrieval systems— that is not driven by theory. Rather, it is 
pretheoretical, gathering data that may lead to theory development later. 
This body of research utilizes research methods drawn from psychology 
and other social sciences. We will focus specifically on studies of error 
behavior because errors interfere with usage and hence with transparency. 

APPLICATIONS OF PSYCHOLOGICAL THEORY 

Three theories will be considered here, each of which is general and 
has been applied to other information technologies. The first is that of 
mental models, an attempt to describe the learning and problem-solving 
processes involved in the use of computer systems. Second is that of 
information processing models, an attempt to build discrete quantitative 
models of interactive behavior. The third theory considered is individual 
differences, an attempt to explain variance inperformanceand interaction 
style by personality and demographic characteristics. 

Mental Models 

The mental models theory, drawn from cognitive psychology, is per- 
liaps the most appealing iheory for the study of human behavior on 
information systems.^ Although it ha£ not yet been applied widely to 
retrieval systems, the research to date holds considerable promise for both 
design and training. 

Psychological Research on Mental Models 

Research in learning theory in various contexts has shown that people 
tend to build hypotheses as part of problem solving. When a person 
approaches a new task, whether it's fixinga toaster or a cirburetor, solving 
a math problem, or learning a text editor, he or she tends to gather 
information from the context of the task. The information might be drawn 
from a manual, from watching other people, from prior knowledge, or 
from the response of the problem to the user's actions. As the user/problem 
solver takes an action — such as turning a screw, writing an equation, or 
entering a command— the problem changes and the resuh is observed. 
From all of these sources the user makes further hypotheses about how the 
entity or problem works and about why it is responding in a particular 
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way. Evidence from actions is taken as supporting or negating the hypo- 
•heses made and the hypotheses are refined accordingly with ihe user 
taking more actions until the task is completed or abandoned. 

All these hypotheses and actions fit together into a "mental model" of 
how the entity works. The mental model starts out tuzzy and becomes more 
clearly defined with experience. It is important to note that the user/prob- 
lem solver is not neccs>?nly aware that he or she has or is applying such a 
model. The model isp» ft of the problem-solving process and usually is not 
a conscious effort. 

The ability to de* < lop a mental model is a valuable intellectual skill 
and one that is very helpful when the information applied to the problem 
is correct and when the hypotheses are correctly interpreted or revised. 
Unfortunately this is not always the case. A person may not gather enough 
information about the task first (read the instructions, assess the nature of 
the problem)^ or he or she may start with incorrect assumptions such as 
that it works like some other entity previously seen or that the problem is 
something other than it actually is. For example, people often assume that 
a text editor works much more like a typewriter than it actually docs or that 
an online catalog is more like a card catalog than is actually the case. To 
complicate life further, people often interpret the results of their actions as 
supporting their hypotheses whether or not they do indeed.^ 

The theory suggests that people can be trained with a conceptual 
model of the system from which they can draw a mental model that is 
compatible with their own thinking processes. The research design typi- 
cally applied in mental models studies is to assign subjects to two groups, 
one trained with a conceptual model of the system and one trained with a 
procedural set of instructions (no framework; just "first do this, then do 
this...."). Tlie underlying hypothesis is that those trained with a concep- 
tual model will develop a mental model and will perform better on the 
tasks, and those trained on!v w 'ih procedures either will develop an incor- 
rect model or will not deveVop a model at all. 

A further hypothesis is that having a menial model is not as important 
for the simple tasks that can be accomplished with one or two predefined 
procedures as it is for more complex tasks that involve multiple procedures 
or extrapolation from basic procedures.® 

Applications to Information Systems 

The first study to test the mental models theory on retrieval systems 
compared the two training methods on ? Boolean-logic-based online 
catalog of OCLC records.^ As predicted, it was found that on simple tasks 
there was no difference in performance (number of items correct) based on 
training, but on complex tasks, those trained with a conceptual model of 
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the system got inore items correct and exhibited different patterns of 
interaction with the system than those trained procedurally. 

The only other study of mei tal models and informa^.ion systems 
identified to date is a master s thesis from the University of Chicago done 
by Jean Dickson.*^ Her study was not experimental; rather, she attempted 
to infer a mental model from the monitoring record of user behavior on 
NOTIS, the online catalog at Northwestern University. Dickson looked 
specifically at the errors in author and title searches that resulted in no hits 
and concluded that users applied different mental models from those 
applied to a card catalog because they searched differently* Her most 
striking examples wtre the frequency of enors due to entering authors with 
given name first (12.6 peiceni of no-hit author searches) and due to the 
inclusion of initial articles in title searches (IC.l pf rcciit of no-match title 
searches), neither of which would be appropriate behavior in a card 
catalog. Other explanations exist for these behaviors, but the data do 
suggest that users make incorrect hypotheses about the system. 

Information Processing Models 

Psychological Research on Information Processing Models 

An information processing model is an attempt to break down human 
tasks into discrete physical and cognitive actions and to c^ssign probabili- 
ties of occurrence and performance times to these actions. The model 
allows task behavior to be calculated and predicted. The computed perfor- 
mance times and patterns can be used to compare methods of performinga 
given task. The best known of these models are the GOMS (Goals, Opera- 
tors, Methods, and Selection rules) and keystroke models of Card, Monm, 
and Newell." The GOMS model predicts human behavior on a specific 
task in terms of the user's goals, operators, methods, and selection rules. 
The model was developed using manuscript editing tasks. In this context. 
Card and his colleagues have achieved roughly 90 percent accuracy in 
predicting behavior sequences and 33 percent accuracy in predicung time 
required for modifications. 

The keystroke model is more discrete and predicts time to pt^fonn a 
given task as a linear sum of four physical and one mental operators. In 
text-editing tests. Card's research team modeled behavior with a 21 percent 
error rate. 

These models are useful for comparing features for implementation in 
designing a system. They have been used for comparisons such as deter- 
mining whether a control character sequence is better for an editing 
function than a function key or whether a mouse is better than a joystick 
for pointing to objects on a screen.*^ 
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The infomiation processing models on a task are built by training 
people until they are expert which may take thousands of repetitions. It has 
been reasonably successful in developing text editing systems which are 
well-suited to expert, highly repetitive behavior. The models also are being 
used to advance the information processing theories of cognitive 
psychology. 

Applications to Information Systems 

The information processing models have not yet been applied to the 
design of information retrieval systems. They may be helpful for determin- 
ing the best use of command sequences in terms of making frequently used 
actions most accessible, minimizing confusion among actions, and so on. 

Overall, the information processing models are less applicable to 
information retrieval systems than to text editing because the task does not 
lend itself as well to expert behavior. The information retrieval task is 
much less clearly defined, requiring heuristic thinking and continual 
reevaluation of the task. Further, few users of information retrieval systems 
use them in a production, expert mode. The vast majority use the systems 
too infrequently to achieve the expert behavior on which the information 
processing models are based. 

Individual Differences 

Psychological Research in Individual Differences 

Most of the psychological research on human interaction with inter- 
active systems comes from the area of cognitive psychology which is based 
on the "information processing model" paradigm alluded to earlier. The 
theory which underlies much of current cognitive research attempts to 
reduce human behavior to information inputs, processes, and outputs. 
The intent is to identify fundamental characteristics across all people that 
can be used to predict behavior. The information processing theorists do 
not acknowledge differences among people. Rather, they treat such differ- 
ences as "random variance." 

Another branch of psychology is specifically interested in that "ran- 
dom variance." Those in the area of "correlational" or "differential" 
psychology look for variance in behavior that occurs naturally and then 
seek factors that differentiate among ini'ividuals or groups. Their intent is 
to identify causal, or at least associative, relationships after the fact. 

The differential psychology researchers have determined that some 
people have an easier ti ae using informarion technologies than others- 
including information retrieval systems, text editors, and programming 
languages. Once the fact has be:n established that a range of behavior 
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exists, the method is to analyze the behavior of a group of people on the 
task, capturing data on as many related factors as are hypothesized to be 
responsible for the differences. 

In text editing studies, researchers have found that age and spatial 
memory are important factors. Those who are younger and who have the 
best spatial memory capabilities perform best on text editors. 

Similarly, researchers have found consistent variance in those who are 
professional programmers, finding that they fall into a consistent style of 
processing — more thinking than feeling, more intuitive than sensing. 
Those who perform best in introductory programming courses also take 
more science and math courses, score better on general achievement tests 
(math and verbal), and get higher grades.^^ 

Applications to Information Systems 

Studies of user behavior on both bibliographic retrieval systems and 
online catalogs longhave found wide variance in usage patterns even when 
the same system and database are used.^® In summarizing the characteris- 
tics of the "average" search across multiple studies, Fenichel reports broad 
ranges in reported means for variables such as number of descriptors 
searched, commands used, connect time, retrieved references, recall, preci- 
sion, and unit cost.^^ Only recently have researchers begun to identify 
systematically the sources of some of the variance observed. 

Amount of experience with the system is the variable most commonly 
studied in identifying performance differences. Fenichel was able to deter- 
mine only that novices (low database experience and low searching expe- 
rience) searched more slowly and made more errors than experienced 
searchers. 

Penniman, in monitoringstudies, found that frequent searchers of the 
NLM Medline system used about the same number of single terms and 
displays in a search as did infrequent searchers but twice as many advanced 
te;m search entries and half again as many Boolean searches. Moderately 
frequent searchers used more of all types of commands than infrequent 
users. 

Three dissertations have explored the personality differences that may 
underlie searching perfoi nance on bibliographic retrieval systems. Brin- 
die studied the relationship between cognitive style and search perfor- 
mance in a field experiment but found few significant differences.^^ 
Bellardo studied graduate library school students who had just completed 
a course in online searching, testing them on two measures of creativity 
and one measure of personality and obtained their Graduate Record Exam 
(GRE) scores. Bellardo attemptt.d to correlate these measures with search 
performance (precision and rea.ll) but was unable to explain much of the 
variance. However, she did find a significant (p < .05) correlation between 
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search performance and GRE quantitative scores but no correL.ion with 
GRE verbal scores."^ 

In a field experiment, Woelfl tested skilled NLM Medline searchers on 
inductive and deductive reasoning and learning style. Woelfl found that 
searchers clustered strongly in one learning style (high active, high 
abstract). Overall, the cognitive attributes affected the search process but 
not search results.^ 

As with other types of information retrieval systems, we find a wide 
range in skills among online catalog users. Monitoring studies have identi- 
fied high variance in the types of searches performed, in the length of 
searches, and in the patterns of errors.^ Each of these were unobtrusive 
field studies and did not collect any data on individual users that could be 
compared to the search pattern data. Survey data of the same population 
found a comparable range of user-reported success and satisfaction levels 
in system use and a broad range of user background characteristics.^^ 

Borgman found significant differences in the ability to pass a bench- 
mark test of information retrieval skills by academic major. Those who 
failed the test were predominantly social science and humanities majors 
while those passing the test were science and engineering majors (p < 
0.0001). Prior computer experience was controlled (subjects had no infor- 
mation retrieval experience and at most two programming courses).^ 

Based on the earlier discussed results, Borgman is pursuing the 
hypothesis that academic major is a gross measure of individual differences 
and is probably a surrogate for other characteristics that are associated with 
major. ® Preliminary results of a study incorporatingpersonality tests used 
by Woelfl and demographic characteristics identified in studies of pro- 
gramming aptitude indicate that engineering majors cluster strongly 
around personality characteristics asso:iated with both information re- 
trieval and programming, while English and psychology majors show 
either no pattern or one opposite that of engineering majors. 



ERROR BEHAVIOR 

The study of error behavior is crucial to the issues of system transpar- 
ency. If a system is transparent, it will support and simplify a task— not 
become a task in itself— and be congruent with the user's thinking style 
and workflow. The difficulty is in measuring these indicators of transpar- 
ency. We find usually that it is easier to gather evidence on when a system is 
not working well than on when it is. Thus, we study user errors and 
problems. 

User errors and problems with information retrieval systems can be 
divided into two categories: those encountered with the mechanical aspects 
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of searching (typos, incorrect commands, etc.) and those with the concep- 
tual aspects (controlling the interaction, achieving useful results, etc.). 

By identifying errors in the mechanical aspects, we can identify poorly 
engineered system factors that may be increasing the likelihood of certain 
types of errors. Identifying the most common errors can lead to isolating 
nonintuitive command sequences, misleading displays, and other 
unfriendly aspects of a system. 

Similarly, by identifying poor levels of searching performance (low 
recall and precision, inefficient use of commands, etc.), we can determine 
ways in which the system interferes with the natural flow of problem 
solving (retrieving information) and the points at which it fails to be 
congruent with thinking style and workflow. It also allows us to identify 
misconceptions about the systems thereby understanding better how peo- 
ple are interpreting system actions and internalizing them into their 
behavior. With such knowledge both the design of systems and training for 
ihem can be improved. 

The causes of the errors and problems identified by studying user 
behavior can only be inferred, of course. But the evidence will result in 
hypotheses about the sources of the behavior that can be taken to the 
laboratory for further study. 

The discussion here is intended to provide only an introduction to the 
kinds of studies that can be done to identify user problems with systems. 
For a fuller discussion of these results and their implications, the reader is 
referred to Borgman^^ (the applications of psychological theory are dis- 
cussed at length in another paper by Borgman^^). 

Problems with Mechanical Aspects of Searching 

Bibliographic Retrieval Systems 

Problems with the mechanical aspects of searching have not proven to 
be a major barrier to the use of bibliographic retrieval systems, although 
several studies have found that they are a barrier for very inexperienced and 
infrequent users.^ Fenichel, in an experiment capturing printed search 
protocols, found that both moderately experienced and very experienced 
searchers made significantly fewer nontypographical errors per search 
than did novices although the overall number of errors was small (2.8 per 
search for novices).^^ 

Defining errors only as erasures, Penniman found an average of 8 
percent of user actions as errors.^^ Tolle and Hah, using the same defini- 
tion in a monitoring study of the NLM CATLINE database, also found an 
average error rate of 8 percent.^ 
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Online Catalogs 

Mechanical problems have been particularly evident in monitoring 
studies of onltue catalogs. ToUe found that errors were not isolated.^^ 
Instead they tended to occur in clusters; once an error was made the next 
transaction v/as likely to be an error as well. In the SCORPIO system of the 
Library of Congress, given that an error was made, the likelihood that the 
next command was an error was 59.8 percent; for the SULIRS system - 
Syracuse University, it was 28.6 percent; for the LCS system at the Ohio 
State University it was 33.3 percent. Errors were defined in SCORPIO as 
unrecognizable search commands; in SULIRS as an unrecognizable com- 
mand, an incorrectly formatted command, or an invalid item number; in 
LCS as partially or fully unrecognizable commands. Data from these 
studies also indicate that users tend to quit immediately after receiving an 
error message. 

In a monitoring study of the Ohio State University (LCS) online 
catalog, Borgman defined two types of errors: logical errors or commands 
that could be partially recognized by the system and typing errors or 
commands that could not be recognized at all. Errors were roughly equally 
divided between the two types. Total errors averaged 13.3 percent of all user 
commands; 12.2 percent of all user sessions studied consisted entirely of 
errors.^ 

Dickson^ and Taylor^^ analyzed the monitoring record of search 
input on the NOTIS system that resulted in no matches on known-item 
searches. Dickson found that 37 percent of all title searches and 23 percent 
of all author searches resulted in no matches. She determined that 39.5 
percent of the no-match title searches and 51.3 percent of the no- match 
author searches were for records that existed in the database and were not 
found due to user errors in searching. Of the errors in title searches, 15 
percent could be attributed to typos or misspellings; the ^-emaining errors 
were conceptual in nature. 

Taylor found that only 22.4 percent of the no-match author searches 
could be determined to be good author names that were not in the database; 
the remaining 77.6 percent could have been for records actually in the 
database. She was able to attribute 22.1 percent of the no-match author 
searches to misspelled words.^ 

Conceptual Aspects of Searching 

Bibliographic Retrieval Systems 

While problems v;ith system mechanics are rare for both experienced 
and inexperienced rearchers of bibliographic retrieval systems, many 
studies have identified significant problems wit!: search strategy and out- 
put performance.^^ Experiments using transcripts of search behavior have 
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shown that searchers often miss obvious synonyms or fail to pursue strate- 
gies likely to be productive/^ Similarly, searchers often fail to take advan- 
tage of the interactive capabilities of the system. 

In a survey comparing searching problems to prior training, Wanger 
et al. found that most respondents said they had difficulty in developing 
search strategies **some" (47 percent) or "most*' (8 percent) of the time and 
36 percent said they had difficulty in making relevance judgments **some'* 
of the time/^ 

Perhaps as a consequence of relying primarily on simple search tech- 
niques, recall scores are often relatively low even when comprehensive 
bibliographies were requested/^ In reviewing studies that computed recall 
measures (using a variety of research methods), Fenichel shows that aver- 
age recall ranges from a low of 24 percent (novices only; 41 percent average 
minimum recall in other cases) to a high of 61 percent. Average precision 
in the same set of stud'is ranged from 17 percent to 81 percent.^^ 

Online Catalogs 

The online catalog studies also have identified many problems with 
the conceptual aspects of searching, although they have focused more on 
problems related to misunderstanding of system features than to achieving 
high levels of performance. Similar to FenicheFs findings,^^ survey data 
indicate that online catalog users rarely ventured beyond a minimal set of 
. system features. The majority of searches were simple, specifying only one 
field or data type to be searched; the advanced search features were rarely 
used; even when systems included the feature of scanning lists of index 
terms or headings, users didn*t utilize the feature unless ''forced" to do so. 

Survey respondents also indicated that they had problems with several 
of the conceptual aspects of searching, including increasii.g search results 
when too litde (or nothing) is retrieved, reducing search results when too 
much is retrieved, and use of truncation. Users reported that they expe- 
rienced a lack of control over the search process and that they found many 
Ot the codes and abbreviations in the displays confusing."*'^ 

In assessing problems with specific types of searching, the survey 
found that subject searching was the most problematic area. Users indi- 
cated ihzi they had problems both with performing? the subject search and 
with identifying the right subject terms. In several monitoring studies 
reviewed by Markey,^^ no-match subject searches range from a low of 35 
percent on MELVYL^® to a high of 57 percent in the BAGS system.^^ 

In the monitoring study conducted by Dickson, no-match searches 
could be attributed to misunderstanding the search structure, such as 
inclusion of initial articles (10.1 percent of no- match title searches), wrong 
name order (12.6 percent of no-match author searches), and the wrong 
forename or the incorrect inclusion of a middle initial (9.9 percent of vhe 
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no-match author searches). Taylor found that 16.7 percent of no-match 
author searches were due to putting the forename first, another 5.6 percent 
to the incorrect use of a middle initial, and 5.7 percent were due to 
searching title or subject terms in the author field. 



CONCLUSIONS 

We have discussed the applications of psychological theory to the 
design of information systems— including mental models, information 
processing models, and individual differences— and studies of error behav- 
ior on both bibliographic retrieval systems andonline catalogs. What does 
ail of this imply for making systems more user friendly or transparent? 

Implications of Psychological Theory 

The results of the mental models research suggest that systems are 
easiest to use when they are designed around a consistent conceptual model 
that is readily recognizable by the user. Further, the training and instruc- 
tions for the system should reinforce the model. Status indicators on the 
display should indicate the current location in the system, the immediately 
previous location, and options for the next location. All of these data are 
helpfuLin providing a comfortable framework for system use. A transpar- 
ent system, in terms of a mental model, is one whose conceptual framework 
is readily adopted by the user, making the system simply a tool to support 
the task and not a task in itself. 

The information processing models have less direct implications for 
user friendly systems design. They suggest that user actions can be quanti- 
fied into a string of additive variables, including reaction time, keystroke 
time, and mental processing time. Therefore, through system evaluation 
and basic research, we should continue to seek some underlying funda- 
mental characteristics of information retrieval behavior. The practical 
results of information processing models' research probably are further 
away from implementation than are the results of other research paths. 

The individual diffeiences research suggests is that different people 
approach systems in different ways, learn at different rates, and prefer 
different types of training and interfaces. The first step in implementing 
the results of individual differences research is to acknowledge that the 
differences exist. When user populations are small or otherwise well- 
defined, it may be possible to identify common characteristics (e.g., conri- 
puting knowledge, retrieval knowledge, subject expertise) and tailor 
systems accordingly.^^ When user populations are diverse and ill-defined 
(as is the case with most populations of public and academic library 
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clientele), individual differences can be acknowledged by providing multi- 
ple forms of interfaces (e.g., menu and command) and by offering multiple 
forms of training (e.g., classroom training, computer-assisted instruction, 
printed materials). The provision of options such as these, while not 
allowing precise tailoring to each individual, does allow users to make 
choices among the interface styles and training methods with which they 
are most comfortable. 

Error Behavior and Transparency 

A review of the research on error behavior suggests that users have 
problems with both the mechanical and the conceptual aspects of search- 
ing information retrieval systems and that the problems occur on both 
bibliographic retrieval systems and online catalogs. We are beginning to 
identify some of the problematic factors, although they vary by system. We 
do know that subject searching tends to be the most problematic type of 
search in most systems, however, and a candidate for closer study. Another 
common factor is the tendency to utilize only a subset of commands, not 
taking advantage of the more sophisticated searching features. We need to 
determine if the higher-level commands are not taught adequately, are 
difficult to implement, or are simply unnecessary for most users. Most of 
all, the results of error-behavior studies suggest the need for continual 
eval uation of systems so that the problems can be identified and the systems 
improved. 

Future Research 

Information systems have not yet reached the stage of being user 
friendly for most of their users. We now know enough to begin to charac- 
terize the problems; much more work is required to find solutions for them. 
We need both design guidelines to alleviate known problems and basic 
research to identify general principles of user behavior. The init'^1 
groundwork for a psychology of human-compu . .havior has been laid 
and research methods exist to continue the wt a -A base of implemented 
systems, available to a variety of user populations, exists for study. With 
sufficient devotion to research, we may soon have a class of ''usei friendly 
retrieval systems." 
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Is ^*User Friendly*^ Really Possible 
in Library Automation? 



Everyone wants to have a ''user friendly" system, but no one can really 
define what user friendly is. A friend suggests that we should be talking 
about ''user seductive" rather than user friendly. Tlie person using an 
online catalog should perceive a natural relationship and not have to stop 
to think about his or her interaction with the computer. The truth of the 
matter is that an online catalog is so much better than a card catalog that 
library patrons will put up with a lot of "unfriendly'* things in an auto- 
mated system. 

The term user friendly has become a buzzword. Everyone would 
probably agree that online library systems should be approachable. How- 
ever, despite efforts to make system use easier, many first-time users still 
feel intimidated. The major cause of user fear may be the everyday jargon 
used by those persons who are the corporate keepers of the Holy Grail— 
i.e., automated library systems. Often the words used in discussing online 
systems are overly exprcss've and needlessly violent in tone. Even the term 
user fits this situation because it sounds drug related rather than library 
related. 

Another example is one that probably causes Reverend Jerry Falwell 
consternation— i.e., "abort, retry, or ignore." Why not use ''quit, try again, 
or bypass" instead? Or another example is a multitasking communications 
program that goes into a "deadly embrace" and so informs the person at 
the terminal. 

The key is to simplify dialogue and use everyday terminology instead 
of jargon. It is just as easy to say "program error" as to say "bug." 
"Running" a program sounds better than "executing" it. And a "system 
failure," no matter how unwelcome, is not nearly as intimidating as a 
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**crash" which sounds positively life threatening. Where docs the compu- 
ter world get so much emotionalism? 

Other examples of misuse of the language are prevalent in the compu- 
ter world. The response should be to promote proper usage and decry those 
who use meai:ingless or ambiguous words. A need exists to step back and 
not lake ourselves too seriously. If this process does not occur, a large 
segment of library patrons wil! be resistant to using automated library 
systems effectively and efficiently. 

Libraries with automated library systems have found that patrons 
naturally gravitate to the terminals of the online catalog because of their 
belief that a "computerized" catalog provides better acctrss to the library 
materials that they wart to use. This situation is imporuint in a society that 
places value on innovation, speed, and ease of use for all services whether 
they be in libraries or other places. User sophistication is constantly 
growing through increased access to automated library systems. Expecta- 
tions of ease of use are also increasing. 

The online catalog should assist experienced and inexperienced users 
by revealing its inner working organization. A person should beable to use 
an online catalog without knowing its structure and without regard to age, 
education, experience, or sex. Minimal instrution sliould be required (less 
than fifteen minutes). 

User friendliness is based on positive interaction between the compu- 
terand the terminal user. Human conversation provides the best model for 
this huDian-computer communication. The problem-solving dialogue 
component of conversation is particularly applicable to human-computer 
applications. In the problem-solving, task-sharing activities of online 
catalog use, something close to symbiosis between the person and the 
computer must be attained. A close union and a result-oriented interde- 
pendence must be established. Several properties of dialogue make it a 
good form of communication for human-computer activity. Stewart, in 
his article "Communicating with Dialogues," reminds us that: "A dia- 
logue is by definition a two-way process....[It] involves the sharing of 
knowledge by the exchange of information." Successful dialogues accord- 
ing to Stewart have five requirements: 

1. Both parties must be able to send and receive information without 
undue constraint. 

2. The language chosen as the carrier of the dialogue must facilitate the 
expression of subtle or complex ideas. 

3. Accordingly, the interpretation and understanding of the language — its 
symbols and rules— must be shared to the same degree by both parties. 

4. Each party must be able to grasp the context of meaning of the other 
party and siiccessfully follow or "keep up" when that context shifts or 
changes. 
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5. This, in turn, requires being responsive to the feedback of x\\q other 
party and ''modifying the communication so that it is letter suited to 
the other party/'* 

Tlie literature contains much information about the general require- 
ments for effective human<omputer communication. Perhaps the experi- 
ence with online catalogs is best explained by tlie observations of Hayes, et 
al. in an article "Breaking the Man-Machine Communication Barrier/* He 
states: 

Simply put, today's systems are not very good at conmnmicating with 
their users. Hicy often fail to understand what their users want them to 
do and then arc unable to explain the nature of the misunderstanding to 
the user. In fact, it is the common experience of users of interactive 
systems, whether novice or cxpericnaxl, infrequent user or regular, that 
communication with their machines is a time consuming and frustrat- 
ing experience. 

Why does this barrier exist? According to Hayes, tlie computer "lacks 
the basic communication skills that come so easily to almost all of ub.*' 
Mutual understanding is a necessity in good dialogue. Partners \\ 
dialogue must be ready to repeat, simplify, or otherwise clarify the mes- 
sage. The timeliness of message acknowledgment and responj?e is also 
important for effective dialogue as explained by Hildreth in his OCLC 
Technical Report ''Optimal Response Times in an Online Interactive 
Computing Environment."^ 

Since language is the medium of message transfer, every effort nuisi Ik* 
made to use semantics and syntax that facilitate the dialogue between the 
computer and the user in terms of vocabulary, sequence, and conse- 
quences. Tlie following things are important when considering ^*»mantics 
and syntax: 

Semantics 

1. Completeness: the set of commands provided must be Gipable of evok- 
ing all the functions required in the user's model of the (ask domain. 

2. Functionality: the operational interrelationships among (he com- 
mands must be flexible enough to be ccmpatible with a variety of user 
activity patterns in online information retrieval. 

3. Singularity: each function should be represented by only a single, 
distinct command, and no command should be capable of <^voking 
muhiple functions. 

4. Modularity: a basic, simple subset of the language, capable of evoking 
the fundamental functions, should be provided for the new or occa- 
sional user. Tlie design structure must l>e hierarchical, permitting the 
learning and use of specialized extensions to the basic subset todevelop 
in a linear manner. 
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5. Variety: extensions to the basic command core representing specialized 
and more sophisticated features should be provided for the experienced 
user. 

6. Linearity: the structure of the basic command core and its extensions 
should permit the user to proceed in a left to right, top down, manner. 

7. Opiionality: alternative ways of expressing commands during input 
should be supported consistent with the syntax criteria listed later. This 
would include the use of abbreviations, flexible argument order, and 
user designation of default values for commands and parameters. 

8. Simplicity: any complexity introduced should result only from the 
extension of the basic command core to include more powerful or 
refined capabilities and not from the meaning of the commands or the 
syntax governing their use. 

9. Consistency: command words should have the same meaning in all 
contexts, and syntax dv.cisions should be applied uniformly throughout 
the language. 

Syntax 

1. The structure of the command statements should be patterned on 
familiar, natural language phrase structures. 

2. Command keywords should be short, familiar words chosen from the 
user's nati'ral language and should clearly express the specific action 
being commanded. 

3. Self-evident abbreviations should be permitted but governed by easy to 
remember abbreviation rules. 

4. Punctuation required for command construction should be kept to a 
minimum and be limited to universally familiar symbols. Blank spaces 
should serve as delimiters. 

5. The positioning of arguments following a command word should be 
flexible and not governed by a predefined, rigid ordering of 
components. 

6. Command syntax should be compatible with display syntax. 

7. Default states for commands and parameters, as well as the capability to 
set or reset them, should be provided for the user. 

8. The user should not have to learn specialized keyboard techniques. The 
use of special function keys or control key combinations should be kept 
to a minimum. Eniry in either upper or lowercase should be permitted. 

During the past five years several research studies related to online 
catalogs have been completed. Most of these studies are useful in providing 
insight into patron attitudes about online catalogs. Virtually all of this 
research has shown that both users and nonusers have a positive feeling 
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about online catalogs. Unfortunately, these same studies do not provide a 
clear picture of what really constitutes a user friendly online catalog or 
even what causes a user to express satisfaction and be positive. No research 
to date has conclusively identified online system features required to 
"optimize'* the human-computer interface. 

As Hildreth states in his book Online Public Access Catalogs: The 
User Interface, online catalogs are experiencing a period of intense devel- 
opment and deployment. Consequently, the emphasis should be on inclu- 
sion of interface features that are generally accepted as user friendly 
through available evidence, if not actual consensus among designers and 
experienced users. Hildreth sums up his comments by stating: "The need 
now is to improve the interface, not optimize it once and for all time."^ 
From a technical point of view, online catalogs are at a stage when software 
development techniques must ' atch up and keep pace with what is known 
about good dialogue between persons and computers in particular and 
user-oriented interfaces in general. Gaines, in his article "The Technology 
of Interaction," reports that there is not yet agreement among software 
designers as to what constitutes good dialogue programming. He states: 
"We are in a position today in programming man-computer interaction 
that we were with hardware design thirty years ago and sofv vare design ten 
years ago."^ 

In addition to the human-computer communication, environmental 
considerations also affect the ease of use of an online catalog. Considera- 
tion should be given to location of terminals, tables used for terminals, and 
the physical characteristics of terminal design. However, experience with 
online library automation systems indicates that color display, graphics, 
touch screen, and monitor display color (amber, green, or monochrome) 
are of minor concern. Other important considerations are level and com- 
pleteness of documentation, display of diacritics, and invisibility of hard- 
ware. The provision of comprehensive "help" modules, which are easily 
accessible and contact sensitive, is important to a user friendly environ- 
ment. This "help" should include user requested help, system defined 
help, automatic response prompts, and informational messages. 

In the area of user friendliness for handicapped library patrons, the 
University of California's Division of Library Automation has developed a 
"talking terminal" that enables blind and visually impaired users to use 
the MELVYL Online Catalog without assistance. The terminal unit incor- 
porates a keyboard labeled with large letters and braille, a scieen display 
that magnifies the image, and a digital voice synthesizer. The terminal 
responds to users by speaking, by providing a tape from a built-in cassette 
tape recorder, and by providing a display up to sixteen times its normal 
size. One dra wback is the current cost— which is $1 0,000— plus the cost of a 
braille printer if needed. 
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Another approach to user friendliness is to give terminal users the 
opportunity to comniuni»,ate questions, comments, and observations. The 
LIAS system at Pennsylvania Slate University uses the OPPS command, 
which allows any user to enter a free-form message. These messages are 
periodically printed out for analysis and for appropriate reaction or 
response. This command can.be easily used without any intervention of a 
staff member. One limitation is that it lacks a means of direct response to 
the person who initiated the command. Thus there is noway to respond to 
a request for information. 

Based on observation and discussion with users of onlinecatalogs, the 
following criteria must be considered in developing a user friendly online 
catalog interaction with the user: 

1. Be conversational but instructional at the same time. 

2. Use consistent format and terminology. 

3. Use mnemonic devices whenever possible. 

4. Use formats which facilitate understanding of information present- 
ed on screen. 

5. Eliminate as many steps as possible. 

6. Be positive in response statements. 

7. Be forgiving of errors in entry. 

8. Accommodate many user punctuation and spacing inputs. 

9. Provide computer response time averaging three seconds or less. 

The current emphasis by the whole library automation industry is to 
make systems more user friendly. This focus is underlying all current 
marketing and design efforts, whether in the area of hardware— such as 
terminals— or in the area of the software which runs the system. User 
friendliness is an ever present force in library automation today. Within 
the next five years, we will wonder why we ever did things in the old 
laborious way— i.e., keying in commands. This process will all be super- 
seded by voice commands or by a mixture of voice and graphics. Weare still 
some years away from one person's dreams— a truly portable terminal 
which can be taken out jogging. 
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User Interfaces for Online Library Catalogs 



As confiputer systems moved out of the laboratory, it became apparent that 
many of them were intended to be used by people who had no knowledge of 
programming or understanding of computer systems. It soon became 
necessary to devdop more congenial ways for people to interact with 
systems, and the whole notion of the "computer-user interface" or the 
"man-machine dialogue" was bom. 

In general, the complexity of a computer system can be categorized as 
shown in figure 1. The figure illustrates how systems complexity increases 
depending on the number of programs or modules it contains and who 
will use it. The simplest case, of course, is a single program written by one 
person for his or her own use. System complexity is increased by two major 
factors: (1) if the system includes more than one progranfi which must work 
together, and (2) if the system is to be used by someone other than the 
programmer who wrote it. Obviously, programmers can get away with 
leaving an error message that says something cryptic like "zero-length 
response not allowed" or worse, but if a computer system is to be used by 
others, the instructions for using the system and any error messages that 
inay be necessary must be clear to the user. 

Online library systems are a classic example of complex systems of 
programs designed for use by others. Library systems are made up of many 
intricate programs with complex relations among them. The library data- 
base consists of numerous files which are interrelated in various ways. 
Library users— both staff and patrons— need to perform a large number of 
complex actions using these programs and files. The challenge to the 
library systenr»s designer is to create an effective user interface for this 
environment. 

Early attempts at developing effective interfaces assumed that it was a 
relatively trivial matter to design a few menus and other screen displays to 
guide the user through the system and provide assistance when needed. 
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Figure 1. How Computer Systems Become Complex 



Many bad experiences have shown that designing effective user interfaces 
is neither simple nor easy. The process of developing a new user interface 
may appear to be simple because the best interfaces are simple and easy to 
use. But this very simplicity is deceptive. It usually means that under the 
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surface, some very sophisticated language parsers, menu displays, and 
command syntax structures are in place that effectively hide all the system's 
complexities' from the user. 

Another difficulty facing designers of user interfaces is that they are 
usually in the position of creating a so-called "user friendly front end" or 
interface to an existing system. Although cleverly constructed menus and 
screen displays can help to make a system more user friendly, some of the 
problems that arise may be caused by a poorly designed underlying system. 
These difficulties may be almost insurmountable. For example, some 
online systems use a derived search key (e.g., first four letters of the author's 
name, followed by a comma, followed by the first three characters of the 
first significant word of the title). Explaining how to conr^ruct this search 
key to the novice user in a friendly fashion is virtually impossible. The best 
solution may be to develop a parser that analyzes the user's search request 
and constructs the search key. This approach may not be entirely satisfac- 
tory because the search results produced by the derived key may not be at all 
what the user wanted. 

Perhaps in the future some new library systems designer will think 
about how users ought to be able to get information out of an online 
library system and then will create a system to support it. Apple took this 
approach with the Macintosh personal computer. Computer designers at 
Apple had many new ideas about how users ought to be able to work with a 
personal computer and thej then set about developing a machine to allow 
these new approaches. Cambridge Library System's original touch screen 
interface to the library online catalog was an early attempt to provide such 
a new mode of access. As is well known now, this effort was pauially 
successful. It was well liked by users in small libraries but found to be too 
slow and too cumbersome by patrons in large libraries. Nevertheless, it was 
a brave attempt at finding an entirely new approach to online library 
catalog access. Similarly innovative efforts must be encouraged in the 
future. 

Features of a User Friendly System 

A user friendly system has a few well-known characteristics. These 
may be implemented in quite different ways, but the results as far as the 
user is concerned will be quite similar. The main features are: 

—Users cannot get **lost'' in the system. The system always let? the user 
know what is being done and offers suggestions about what to do next. 
There are no surprises. The system does not, at the touch of a finger, go 
off and perform some action that is totally incomprehensible to the user. 




61 



USER INTERFACES 



55 



--Users cannot enter illegal commands. The system prompts for certain 
user actions. If the anticipated choice is not made, the system does noth- 
ing or prompts again for the expected data perhaps at the same time sug- 
gesting an appropriate choice. 

-—Users can choose among several different simple modes of data entry. 
One keystroke or perhaps pressing the enter key is all that is required of 
the user. Some systems that are equipped with a mouse or other pointing 
device allow users to enter data using the device or using the keyboard. 

--Users are in control The user is given encagh information about the 
system to understand how it works and how to make it do what the user 
wants. The user has a mental model of the system. 

User Interfaces for library Applications 

Online library systems need good user interfaces to be successful. 
Although the basic concepts of online information retrieval are relatively 
simple, the features of many online catalog systems are not simple. 
Further, the library card catalog itself is not simple or easy to use. Thus an 
effective user interface for an online library system has to deal with com- 
plex record structures, data types, and files and it must be a comfortable 
tool for the occasional library user as well as the experienced staff member. 
This is asking a lot. The specifications for most libiary user interfaces have 
a built-in multiplicity of functions that is seldom questioned. For 
example: 

— Is the user interface intended to teach users how to use the online catalog 

or how to use the library or both? 
—Is the user interface intended for use by library patrons or staff or both? 
—Is the user interface intended to make the system easy to use or easy to 

learn or both? 

—Is the user interface intended to serve the infrequent or casual user, the 
expert user, or both? 

If the answer is **all of the above" it is no wonder that it has proven difficult 
to de\'elop user interfaces for online library systems that are adequate. 

Modifications to the NOTIS Screen Displays for PennLIN 

The Northwestern ^^nline Total Information System (NOTIS) soft- 
ware is being installed at the University of Pennsylvania to form the heart 
of PennLIN, Penn^s Library Information Network. As part of this process, 
the N0T7.S screen displays are being modified to suit the Penn 
environment. 
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NOTIS is a command-driven system; that is, the underlying design 
philosophy in constructing the system is that users should be able to enter 
any command for which they are authorized at any point in the session. 
Therefore in NOTIS, the screen displays that have been developed for 
Northwestern's Library User Information System (LUIS) are really 
prompts for commands rather than true menus. One of the difficulties that 
arises is how to present all possible options to users without overwhelming 
them with choices. Another problem is how to move from the command 
prompt screens to help screens and back without confusing users. Figure 2 
shows the NOTIS screen displays and the linkages between them. 
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Figure 2. NOTIS/LUIS Screen Displays 
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Although the choice of wording on the screen displays is very impor- 
lanl in creating a user friendly system, having an underlying str icturc for 
the screens and menus that is clear to the user is even more important. It is 
this structure that serves to: 

—keep users from getting 'lost" in the system; 

—keep users in control; and 

— give users a mental model of the system. 

In a recent article, Ben Schneiderman has stated that ''the primary task 
for menu designers is to create a sensible, comprehensible, memorable, and 
schematic organization."^ He likens this organization to a breakdown of a 
larger whole ato its parts such as the chapters in a book, a catalog into 
sections, or a restaurant menu into categories such as appetizers, soups, 
and desserts. Schneiderman further categorizes menu systems into two 
main classes: (1) single-menu systems— that is, a system with one menu 
offering binary or multiple choices; and (2) sequences of menus. These 
may be linear, tree structured, or cyclic or acyclic networks. Simple menu 
systems cause few difficulties for users because they oi. jr limited options, 
but more complex menu structures require that users understand how to 
move around the system—that is, how to navigate the sequences of menus. 

Very little lesearch work has been done on the best format and content 
of screen displays. Empirical findings have led to some conclusions, but 
much more work needs to be done. Preliminary findings indicate that 
graphic design and layout of the displays are extremely important, but 
there are almost no research studies that have focused on library applica- 
tions. Opinion varies on the use of color to enhance computer displays for 
patrons or staff, but no definitive research results are known which support 
the superiority of color or monochrome displays for use by patrons or staff . 

Some conclusions gathered from observation and experience are: 

—-don't use jargon; 

—-especially don't use computer jargon; 
—use familiar and consistent terminology; 
—use consistent and concise phrasing; 

—make screens consistent in format and terminology from frame to frame; 
— use the same area *he screen to display prompts or for user data entry 

from screen to screen; 
—don't make the screens too full; 

—don't overwhelm the user with choices; five to seven options are optimal; 
— make sure that menu choices are clear and do not overlap; 
—try to eliminate error messages. Users should not be able to do anything 
"wrong*'; 
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—give users a hint about the result of menu choices— that >, there should 

be no surprises; 
—use letters or numbers for menu choices but not both; 
—letters ? e preferred for menu choices because of their mnemonic value, 

but letters must be selected with care; 
—put the most important part of the instruction at the beginning of the 

line; 

—keep tl.^ user in control. Have an ''undo" key if possible; and 
—keep it simple, remembering that what seems simple to the user may 
mean that very sophisticated processing is taking place in the back- 
ground. 

Research Topics for Developing User Friendly Systems 

As indicated earlier, there are many topics in the design of user 
interfaces that are poorly understood. There is a great opportunity for 
testing the online library systems now in use to see what works and, 
conversely, what does not and to test new ways of handling various inter- 
face problems. Some unanswered questions that need research foUo^y. 

—When specifying a menu choice, should the action precede or follow the 
explanation in the display— e.g.. 

Type A To search for an author 
or 

To search for an author Type A 
—What area of the screen display should be reserved for user data entry? 
—What is the best place on the screen to use for error messages (if any)? 
—Highlighting seems to be effective if not overused. How is it best em- 
ployed? 

—What about color displays? Is color just a frill? Does it have any use 

besides its value as a novelty? 
—What indicators on the screen (e.g., leader dots, dashes, arrows, blank 

space) work best to connect menu choices with the appropriate actions? 
—How is white space best used? 

—What is the best way to liandle a menu or screen display that is too big to 
fit on one screen? 

Specifically for the library online catalog, a few other areas in which 
research results would be extremely useful: 

— 7)2 what order should bibliographic records appear? Should it be the 
same for staff and for public users? Some online library systems display 
bibliographic citations in conventional author-title order while others 
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display the most recenvadditions to the file first. What order is preferred? 
Should users have a choice? Should user*; learn how to do online sorting 
of records? 

— What fields or data should appear for each bibliographic recordfShould 
there be a default which users may change} Research has shown that 
patrons use very little of the information that is available on the conven- 
tional library catalog card. Are all of the data elements necessary? Which 
ones should users see? Should users be able to select different levels of 
completeness? 

—What techniques work best to help users whose search strategy nets too 
few results or too many? Preliminary analysis of online catalog transac- 
tion logs shows that users whose search strategies retrieve too many or 
too few citations have difficulty in narrowing or broadening the search. 
Is this a fault of the system? Is it because users don't understand retrieval 
of St IS? A related issue is how much information is enough information 
and what is the library's responsibility to make sure users get all avail- 
able information whether they want it or not? 

— How can one determine if a search thai results in no hits represents a 
failure of the yystem or a failure of the database? Several studies have 
shown that searches of online catalogs sometimes do not retrieve the 
desired records when the records are in fact in the database. Sometimes it 
is difficult to determine if this is the fault of the system or the fault of the 
search strategy. In other cases, the desired records really are not in the 
database. What tests can be made to increase confidence in the retrieval 
effectiveness of online systems? 

— What is the best way to explain the basic idea of retrieval of sets to library 
users? Analyses of online catalog transaction logb and user comments 
reveal that some patrons and staff members really do not understand the 
concept of retrieval of sets of citations from an online database. Some of 
these same users have had a formal introduction to set theory, but 
apparently there is no carry-over to the library application. How can the 
interface give these users a good mental model of the online database so 
that they will truly understand how the system works? Should online 
catalogs emulate the card catalog? Are there other models that are more 
appropriate? 

Conclusion 

User interfaces have come a long way since the earliest attempts to 
design user friendly systems. Few of today's systems resort to the withering 
''illegal command" or other decidedly unfriendly responses. Most of the 
new systems are no longer user hostile or user vindictive, although some 
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are still rather opaque and require that users learn more about the system 
than most ofthem ever wanted to know. Still, thousands of happy users use 
hundreds of these systems every day, but there is still a long, long way to go 
before online systems Jire truly user friendly. 

In the future, advances in artificial intelligence and expert systems 
may offer a path to creating user interfaces that are more natural; that is, 
they will allow queries to be posed in natural language and help users to 
select the most appropriate sources for searching. But in the near term, if 
systems designers applied the empirical results gleaned from online 
catalog use and other studies that are available already, online systems 
would improve. More information about how users would like systems to 
operate and a much better understanding of the fine art of designing 
screens and constructing menus \vill help to make online systems much 
more acceptable to users. Perhaps that is what wc really mean by user 
friendly. 
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Taming ^he Unfriendly System: 
Microcomputers as Patron Terminals 
to Access an Online Catalog 

Two concepts capture the spirit of this discussion. The first is a quote by 
Marshall McLuhan who said about technology: "IC it works it's obsolete."^ 
The other is Finaglc's Law Of Information which states: 

The information you have is not what you warn 
Tlic information you want is not what you need 
Tlic information you need is not available.^ 

/lutomation in libraries has been on a rapidly moving roller coa :r 
over the past decade. At first the major concerns were whether to automate 
with the few existing vendor systems. Many libraries designed their own 
systems while others shopped around or waited. Over Ihe past five years 
many turnkey systems promising an automation nirvana for libraries were 
developed. This conference and the growing body of literature sliift this 
emphasis toward making these systems easier to use. An even newer con- 
cern is how to develop integrated systems or "information gateways" to 
allow access to expanding internal and external databases. 

Tlie rallying cry for librarians is not whether to automate, but can we 
automate our many internal processes and access external systems from the 
same terminal. Automation, however, takes place within the constraints of 
limited money. Tlie question becomes. How can we allow all patrons easy 
and quick access to a new world of information choices? 

We have slowly— and sometimes quite painfully (in terms of money 
and staff morale)— come to the realization that automation might not be 
the panacea. The programming proverb "Garbage in— garbage out** is 
haunting many a library committee as its members wrestle with past 
cataloging and classification inconsistencies. Automating our catalogs or 
':irculation functions means users must develop entirely new habits in 
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their quest for information. In addition, making only our catalogs avail- 
able online is not enough for many patrons. On many occasions we have 
found patrons attempting to locate citations to journal articles in our 
online cataiog. When told they couldn't do this, their most common 
responses were Why? or I thought that computers could do this. 

Searching these online catalogs is another problem area. In many 
searches more than one patron-initiitcd action is necessary to find a 
citation to a specific book (e.g., a title search finding many matches might 
need narrowing). Rememberinga longset of commands or different search 
strategies for authors, titles, or subjects is difficult for patrons who are 
infrequent library users. Having thousands of new freshmen, graduate 
students, and faculty needing assistance each year can strain an already 
difficult bibliographic instruction program. The cost in staff morale is 
immeasurable. 

In user friendly systems, symbols are explained and patrons are led 
through this maze of commands. Many librarians need help with systems 
that ate not friendly or friendly enough. How can they improve these 
systems? Is v le a system flexible enough to allow access to numerous 
systemsat iht same time? The answer toall these questions is microcompu- 
ters. Before proving this to you, I would like to present an overview on the 
direction of catalog and circulation automation within Illinois academic 
libraries. 

Automation in Illinois 

At present there isan automated short-record circulation system called 
LCS. LCS is the largest cooperative resource sharing network located 
within a single state Every academic library desiring state money for 
automation must join LCS. There are twenty-seven member libraries 
including every four-year state-supported university and ten private insti- 
tutions like DePaul and Judson College. As of April 1986, they have input 
8.9 million titles and 15.4 million volumes into an IBM 3801 computer 
located in Chicago. 

Members agree to share any circulating item with patrons associated 
with other member institutions. Delivery of materials takes seven to ten 
days and is through the Inter-System Library Delivery Service, an Illinois 
State Library funded operation. Each ' 'brary has i'.- wn separate database, 
but every one of the 637 terminals can search each database individually. In 
FY86 there will be over 35 million searches on the system and over 300,000 
interlibrary circulation transactions. Since the network's inception a little 
over iive years ago, LCS schools have borrowed from each other over 1 .25 
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At the University of Illinois at UrbarvChampaign (UIUC), LCS is a 
kno*vn-iiem circulation system. The system came from Ohio State Univer- 
sity (OSU) in the late 1970s and is totally different from the OSU system in 
record appearance and searching capabilities. Library locations, call 
numbers, and circulation status are given for every book or journal ever 
cataloged on the UIUC campus. In addition, serial holdings and order 
records are also included in LCS. Searches by author, author-title, title, 
specific call number, and a general call number range are possible. 

A searching algorithm consists of a three-letter command followed by 
a slash and then a six-three or four-five combination of letters from the 
author s name or item's title. Books written by Kate Turabian are found in 
LCS by entering AUT/TURABIKAT followed by a carriage return. There 
are six different »hree-letter commands and four two-letter commands used 
to manipulate individual screens. In all there are ninety-six permutations 
of searches using these commands. In addition, there are 107 stopwords to 
remember not to use if they appear anywhere in a title or author's name. 
After getting a record the patron must know the symbols for locations and 
there are over 120 of these symbols. Searching another LCS school, patrons 
must put a two-letter code at the end of the command and then search the 
twenty-six databases separately. That could involve keying in sixteen 
characters twenty-six times (e.g., to search for Turabian at DePauI one does 
AUT/TURABIKAT/DP). 

To make things more confusing, we have in the past year brought up 
our Full Bibliographic Record system or FBR for short. FBR contains 
materials cataloged in UIUC since we joined OCLC in late 1974. There are 
almost 1 million bibliographic records and over 2.3 million authority 
records. Included in the authority file are the Library of Congress name 
and subject heading tapes. A government grant to develop FBR stipulated 
that we include a pablic library in the development stages. The River Bend 
Library System agreed to test FBR with us. At present no LC^ school 
participates in FBR. 

FBR is searchable by subject, author, series, ISBN, ISSN, and by 
keyword title or keyword corporate authors. Boolean searches and right- 
side truncation are also possible for most searches. However, searches of 
FBR use a different command language and searching algorithm from 
LCS. There are over fifty possible searching a^'rorithms using FBR. The 
stopword list used by FBR is smaller than tha*. used by LCS. In calendar 
year 1985, over 5.5 million searches took place on FBR. After a FBR search, 
a link to the corresponding LCS circulation record must be made to find 
locations. We wanted to direct all known-item searches to LCS first 
because LCS contains records for everything ever cataloged at UIUC. 
There are 3 million titles in LCS v. only 1 million titles in FBR. 
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By the end of 1986, an ILLINET Statewide Online Catalog including 
everything cataloged via OCLC by every library in Illinois will exist. This 
catalog vail contain over 3.4 mi?lion unique titles from more than 280 
libraries. LFsing FBR searching algorithms, a user at any terminal will first 
search the local database and then will be able to look for an item anywhere 
in the state. Until FBR has an interlibrary loan module, only items from 
LCS schools will have a link to a corresponding circulation record. Other 
items will be available through the old interlibrary loan method. 

Figure 1 shows some of the questions patrons must answer and under- 
stand before they can successfully search our system. Wrong answers toany 
question or not understanding the concepts behind these questions led to 
high failure rates in our online catalog. We have two command-driven 
systems that use differen searching algorithms with different siopwoids 
and access points. This also increased the failure rate. 

Every time someone wanted to search for a book or journal ihey had to 
know what they were doing or else have help from a staff member. Printed 
search aids and online help screens are available at every terminal. How- 
ever, the ;o»:ger a patron sits at any terminal the less chance that someone 
else can do a search. The longer a search takes the more frustrated a waiting 
patron becomes and the longer the queues. In addition, our suggestion box 
was filled with complaints about our system. Since we have only eighty- 
nine public terminals we needed to be sure that patrons could search our 
online catalog rapidly, yet successfully. We also had to relieve the burden 
placed upon the staff of constantly explaining every search in two different 
systems. Staff morale was at an all-time low and sinking fast. We needed to 
make a totally unfriendly system user friendly. 

Defining User Friendly 

Before developing or accepting an interface to any system one must 
define the goals of tnat interface. We had eight criteria that we wanted our 
interface to meet. They were as follows: 

1 . that any patron can easily and rapidly use the system to find most things 
looked for without having to use commands or ask for help; 

2. that patrons, in order to realize the full power of our system (i.e., being 
able to do every type of search), should be allowed to use commands 
from any temnnal if they want; 

3. that certain types of searches are not used that much and should be done 
only through the command syntax (e.g., a Boolean search of an ISBN 
and subject); 

4. that the system response rate, which was less than a second in most 
searches and not more than three to four seconds in others, cannot be 
compromised; 
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!• Which system do I search? 

LCS for known item or location/circulation information 

FBR for subject, keyword, new material 
2. What are the commands? How are they entered? 

LCS ~ AUT/ 5-3 

FBR — FA JjtttTt name first nauae middle name or initial 



3. What does the three letter location code on LLo mean? 
STX » stacks UGX » \indergraduate library 

4. How do I find call numbers/locations after searching FBR? 
Use link command 

5. What if my subject heading does not work? 
Check online authority file 

Then issue find command to get bibliographic records 
Then use link command to get LCS record 
?• How do I pick out one match from many on the screen? 
Use DSL/line number command 

Figure I. Thought Processes Necessary to Search Our Online Catalog 

5. that the interface should improve some of the shortcomings inherent in 
any system; 

6. that the interface with minor changes for location and circulation codes 
should be usable in all of the other LCS libraries; 

7. that the interface's development be evolutionary and easy to change and 
update as the capabilities of the system changed; and 

8. that the interface will not deter but assist us to incorporate future 
technological and software innovations into our system. 
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With these assumptions in mind, we liecided there were two ways to 
implement a user friendly interface. One method involved the develop- 
ment of programs residing on the mainframe computer in Chicago. How- 
ever, this was not viable because of two important reasons. First, it would 
increase the amount of communication between the mainframe computer 
in Chicago and hundreds of terminals around the state. This would not be 
an efficient use of either the communication lines or the mainframe. On a 
computer doing 35 million searches, response time could be degraded. The 
second reason involves the political nature of any large network. Change 
happens very slowly if at all. Program changes on the mainframe are 
difficuh and time consuming because in this network, decisions for soft- 
ware updates are made by committee. The best solution to our problem of 
the unfriendly system was an interface developed and controlled at the 
local levd. 

An interface acts as a translator. The human speaks onr language, the 
computer processor another, and the interface .software mediates between 
the two. It transforms the general wishes of man info the exact commands 
demanded by the processor. We wanted these wisi.. *o be correctly con- 
veyed before they reached the mainframe. Employing intelligent terminals 
to access the mainframe was the answer. C.C. Cheng, a professor in our 
Linguistics Department, helped vs toward this solution by developing and 
writing the interface. 

The interface resides in IBM personal computers. The programming 
for the interface is primarily in BASIC with a small part in assembly 
language. These personal computers do not have disk drives and at present 
only search the online catalog. They were purchased in 1983 and at the 
time vere much cheaper than models having disk drives. The program 
transfers from a floppy disk to a cassette and then the cassette runs the 
program into the personal computers. The transfer time is approximately 
four minutes per terminal. The personal computers stay on twenty-four 
hours a day. The interface needs reloading only when a change is made in 
the prograra or the power fails. 

Cheng's premise behind the design of the interface was to program "a 
user interface aimed at capturing the natural processes of the user search 
and at providing a graceful interaction between the patron and the compu- 
ter." He wanted t*^ unlock rhe power of the online catalog but at the same 
time have the patron need only typing skills to use the system. For those 
patrons who learn how to use our online catalog, the personal computers 
are also searchable using the command language. The following examples 
detail the power inherent in using the interface. 

The interface is a menu with only five choices for searching (see fig. 2). 
All searches, except subjects, go first to LCS. If the search is unsuccessful, 
the patron could then search FBR. If the patron pressed the question mark 
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key, he or she would go through one short helpscreen. The help screen has 
paragraphs comparing LCS with FBR, information about correcting mis- 
lakes, location of the return key, and information as to how to contact 
Qieng with suggestions. Since the interface is self-explanatory, this was 
the only help screen necessary to use the online catalog. The numbers in 
the upper right-hand corner are a real-time clock. This clock assists stu- 
dents in getting to class on time and may help tosp>eed up users' time at the 
terminal. 



INTERFACE READY ( SEE THE BOTTOM LINE TO SEARCH; PRESS <?> 



PRESS 1 FOR AUTHOR-TITLE 2: TITLE 3: AUTHOR 4: CALL NO. ETC. 



Figure 3 presents a typical title-search sequence. What the patron 
enters is shown in quotation marks for illustrative purposes. A normal 
search of our online catalog does not need quotation marks. What the 
interface actually sends to the mainframe is in small letters. This does not 
actually appear on the screen during a search. The patron answers yes if 
looking for a periodical. LCS has the ability to limit search results to serials 
only. The patron then is asked to give the firstandsecond words of the title. 
If any of the 107 stop words are input, the computer beeps and instructs the 
user to replace that word with another word. The interface takes the words 
input by the patron and puts them into the appropriate LCS searching 
algorithm (e.g., TLS/PETEPRINC). This command goes to the main- 
frame computer. Two matches return and the patron is asked for a line 
number (e.g., "2") and the interface does the LCS command DSL/2. An 
explanation of what appears on the screen can be seen by asking foi help. A 
patron can then charge out the book. We allow patrons to charge their own 
books from every library on and off campus. Patrons can also search other 
libraries, and the interface then automatically searches the databases of all 
the other twenty-seven LCS schools. The searcher does not have to 



FOR HELP. ) 



5: SUBJECT 



Figure 2, First Screen of the Interface 
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remember the two-letter codes or input the sixteen characters. The inter- 
face instructs the user as how to charge the book from another campus. 
PaU-ons with invalid identification cards (IDs) or holds on their IDs are 
instructed to ar>k for help. Tl;is whole process is rc.pid and mistake proof. 



IS THIS TITLE A PERIODICAL? (PRESS <y> IF YES <ENTER> 

OTHERWISE) no 

TYPE TH13 FIRST IMPORTANT WORD OF THE TITLE AND 

PRESS <ENTER>: »»PETER" 
TYPE THE «EC0ND IMPORTANT WORD OF THE TITLE 

AND <ENTER>: "PRINCIPLE" 
(IF NONE, PRESS <ENTER>; IF UNKNOWN, PRESS <~> AND <ENTER>) 
TITLE SEARCH: PETER PRINCIPLE 

tls/petriprinc 

SEARCHING CIRCULATION RECORDS 
FOUND 2 SIMILAR ITEMS 

LOOKING FOR EXACT MATCH. ONE MOMENT . . . 



A search for books and journals on a specific subject begins in FBR 
(see fig. 4). The patron starts by typing a term or terms and can then limit 
the search if necessary (e.g., TOPIC— T, etc.). The interface puts this 
request into the proper search algorithm and searches the authority file 



BEGINNING OF SEARCH 



Figure 3, Sample Title Search (part 1) 
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01 PETER, LAUREKCE JOHNSTON, 1919-THE PETER PRINCIPLE$NY 1970 

02 PETER, LAURENCE JOHNSTON, 1919-THE PETER PRINCIPtS$NY 1969 
THESE ITEMS MATCH YOUR REQUEST 

TYPE A LINE NUMBER TO SEE CALL NTOBER OF THAT ITEM AND TO 
CHARGE, RENEW, ETC.; OR PRESS < ENTER > TO GO ON 

It 2 II 

dsl/2 

653P441P PETER, LAURENCE JOHNSTON, 1919-THE PETER PRINCIPLE$NY 
922826 1969 2 ADDED: 780702 

01 001 16-4W STX RNEWD 860212/860427 UC 

02 002 16-4W CRX 

DO YOU NEED HELP WITH THE SYMBOLS. PRESS<Y>IF YES 

<ENTER>OTHERWISE 

"Y*' 

THE CALL NUMBER IS 658P441P. THE LIBRARY HAS 2 HOLDINGS 
LINE 01: COPY 1 IS IN STACKS— CIRCULATION DESK/STACKS 2ND 
FLOOR LIBRRARY* 

THE LOAN PERIOD IS 16 WEEKS FOR FACULTY " 4 WEEKS FOR OTHERS 
BUT IT IS CHARGED OUT AND RENEWED. AND IT IS DUE APRIL 27 
1986 

LINE 02: COPY 2 IS IN COMMERCE — 101 LIBRARY 

THE LOAN PERIOD IS 16 WEEKS FOR FACULTY - 4 WEEKS FOR OTHERS 

Figure 3. Sample Title Search (part 2) 

(i.e., B ST for browse subject topical). The results in this search do nCv 
match what the person want^. The interface then will try to search the 
general authority file (i.e.^ not just for topics but also geographical, per- 
sons, etc.). Failing to find anything, the interface automatically does a 



ERIC 



76 



10 



GARY GOLDEN 



PRESS <C> TO CHARGE OUT,<R> TO RENEW, <S> TO SAVE,<ENTER> TO 



WISH TO SEARCH OTHER LIBRARIES? (PRESS<y>IF yES,<ENTER> 
OTHERWISE 



TYPE AN INSTITUTION NAME, CITY NAME, LIBRARY CODE, OR <ENTER> 
FOR ALL 

TRITON COLLEGE 

t Is/pet eprinc/tc 

DEPAUL UNIVERSITY 

tls/peteprinc/dp 



keyword title search (F T for find title). We took this approach because of 
the problems inherent in subject searching.^ The results of the search in 
figure 4 were two records and the patron asked to see record one. The 
searcher makes another attempt to find headings by pressing the "H" key 
and is given the relevant heading "Ballistic Missile Defenses." The inter- 
face would then go back into the authority file under a new subject heading 
"Ballistic Missile Defenses/' This later approach finds twenty-five addi- 
tional books about "star war defenses/' The assumption being made here 
is that the first subject heading is the most imF>ortant. This then leads the 
patron to additional sources. Vv hen the user needs to link to an LCS record 
for circulation information, the interface takes care of this too. 



GO ON 



Figure 3. Sample Title Search (part 3) 
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BEGIN, TYPE A TERM AS GENERAL AS POSSIBLE TO DESCRIBE THE 
SUBJECT: 



MOMENT. ) 

"STAR WARS DEFENSE" 
TYPE A MORE SPECIFIC WORD OR PHRASE AND <ENTER>* 
( PRESS <ENTER> IF YOU AREN'T SURE OF THE WORDING. ) 
PRESS <ENTER> FOR ALL OR PRESS THE CORRESPONDING KEY IF THE 
SUBJECT 

IS ABOUT A TOPIC— <T>, PERSON— <P>, CORPORAaE"-<C>, OR 

GEOGRAPHIC AREA— <G> 
SEARCHING THE FULL BIBLIOGRAPHIC RECORDS OF THE HOLDINGS 

ACQUIRED SINCE 1975 
b st star wars defense 

;.UTHORITY DISPLAY 
X. STAR WAR FILMS— JtJVENILE LITERATURE 

2. STARA PAZOVA, SERBIA 

3. STARAIA RUSSIA (R.S.F.S.R) —MUSEUMS. 

4. STARBORN MAGIC MUSHROOM 

5. STARCH 



The interface is menu-driven with an interactive dialogue benveen the 
user and the personal computer. An analysis of research on user aspects of 
computer designs proved to us the follo\;ing: 

1. a menu dialogue should be employed when the command set is so large 
that users are not likely to commit all commands to memory; 



(YOU'LL BE ASKED TO PROVIDE MORE SPECIFIC INFuRMATION IN A 



—MARKETING. 



--PERIODICALS 



Figure 4a. Example of a Subject Search (pan 1) 
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8. 



STARCH IW MEDICINE 



9. 



STARCH INDUSTRY 



10. 



—EQUIPMENT AND SUPPLIES 



THESE ARE THE CLOSEST SUBJECT HEADINGS. DO ONE OF THE FOLLOWING 
TYPE A NUMBER AND <ENTER> TO SEE THE CORRESPONDING 

BIBLIOGRAPHIC RECORDS 
PRESS <B> TO BROWSE— TO SEE MORE HEADINGS 
PRESS ^E> TO END THIS SEARCH 

PRESS <I> i'OR AN INTERPRETATION OF THE SYMBOLS TO THE LEFT OF 
THE HEADINGS 

IF THESE HEADINGS ARE NOT SATISFACTORY, PRESS <ENTER> TO TRY 

ANOTHER SEARCH 

t s star wars def'^«nse 

t s star wars defense! 

TRYING TO FIND STAR WARS DEFENSE IN TITLES... 
t t star war defense 



1. BOVA, BEN, 1932- ASSURED SURVIVAL PUTTING THE STAR WARS 
DEFENSE IN PERSPECTIVE / BEN BOVA. BOSTON : HOUGHTON 
MIFFLIN, 1984. VII 343 P. ; OCMlO-780039 



2. a menu dialogue ^ihould be considered for inexperienced users because 
little training is needed; 

3. a menu dialogue should be used when at least some of the users may be 
unfamiliar with the system functions; and 

4. that the wording and order of any menu should be consistent with the 
ccmniimd language.® 



BIBLIOGRAPHIC DISPLAY 



Figure 4b. Example of a Subject Search (part 2) 
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2* SHERR, ALAN B* LEGAL ISSUES OF THE STAR WARS DEFENSE PROGRAM 
/ BV ALAN B. SHERR : BOSTON: : LAWYERS ALLIANCE FOR NUCLEAR 
ARMS CONTROL^ :i984?: 38 P. ; 0CM1I-41K172 

FOUND 2 RECORDS 

THESE ARE SHORT RECORDS 1-2. DO ONE OF THE FOLLOWING: 
PRESS <C> FOR CIRCULATIOh INFORMATION. 

TYPE A NUMBER AND <ENTER> TO SEE THE CORRESPONDING FULL RECORD 
PRESS <ENTER> TO END RECORD DISPLAY 

8 1 

BOVA, BEN, 1932 

ASSURED EVRVIVAL PUTTING THE STAR WARS DEFENSE IN PERSPECTIVE / 
BEN BOVA. BOSTON t HOUGHTON MIFFLIN, 3,984 
VIII 343 P. ; 22 CM 

BIBLIOGRAPHY : P. 342-343. ISBN 0395364051 
1. BALLISTIC MISSILE DEFENSES-- UNITED STATES 2. ATOMIC WARFARE 
3. SPACE WEAPONS 4. UNITED STATES— MILITARY POLICY I. TITLE 

PRESS <H> TO MAKE ANOTHER ATTEMPT TO FIND RELEVANT HEADINGS 
OR PRESS <ENTER> TO GO ON 

A RELEVANT HEADING ISt BALLISTIC MISSILE DEFENSES 
Figure 4c. Example of a Subject Search (part 3) 

We met these criteria by having large numbers of different commands, 
users who were inexperienced, and two systems with completely different 
functions. A microcomputer-based menu interface solved our problem by 
eliminating the need for commands. Also the two systems become trans- 
parent to patrons. The difficult thought processes pre:.cnted in figure 1 
were no longer a problem. 
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The Benefits of Microcomputers 

Using microcomputers has also allowed us to gain more than just a 
user friendly system. The benefits are applicable to any other system even if 
it was vendor developed. 

More Efjicient Use of a Mainframe or Minicomputer 

There has been a reduction in our error rates in FBR. Errors averaged 
almost 23 percent on the nonpersonal computer or dumb terminals and fell 
to only 6 percent on the intelligent terminals (see fig. 5). This meant that 
on the dumb terminals, one in every four searches resulted in an error 
message. LCS, which is not as complicated as FBR, has also seen a corres- 
ponding drop in the number of bad searches. Two other studies have found 
error rates of 1 1 and 13 percent respectively.^ 



1 PERSONAL COMPUTERS | 
1 U 


OTHER TERMINALS | 

1 


I 1 
LCS 1 1% 1 

1 1 


1 

11% 1 
1 


1 1 

FBR 1 6% 1 
1 1 


1 
1 

28% 1 
1 



ERROR RATES FROM OTHER STUDIES: 



BERKELEY 11 % 

OHIO STATE 13 % 

Figure 5. Average Error R:ues for 1985 



In addition, the load on the mainframe is balanced, Some people "!! 
be sending searches while others are still formulating iheir search strategy 
on the personal computers, This guarantees that hundreds of simultane- 
ous commands do not reach the front-end processor at the same time. Every 
search on a computer, no matter if it results in a good response or an error 
message, takes up machine resources. With 35 million searches in LCS, an 
error rate of 1 1 pei 'rent meant that over 3.5 million searches wasted compu- 
ter resources. This ievel of erroneous searches cannot bean efficient nszoi 
any machine. If the mainframe or minicomputer is not large enough to 
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handle ihe load, ihis error ratio can cause degradation of resi>onse lime. 
Even wiih FBR and the iniroduciion of more lim -consuming keyword 
searches, we have not degraded response rime. 

Transparent Interface That Doesn't Require 
Learning System Commands 

All searches, except subjects, go first to LCS because it contains 
records for everything cataloged at UIUC Unsuccessful searches a'-e then 
automatically routed to FBR. Tliis routing takes place at the looi^ level 
before the search goes to the mainframe. Commands are not input by ihf* 
patron. Stopwords are not a problem. Exj. nations are given for all codes 
found "n either LCS or FBR. Patrons are also led through a search of an 
authority fSle with cross and see references, then to the corresponding 
bibliographic record, and then to a separate database— LCS— which con 
tains circulation information. The movement back and forth between fiiR 
and LCS is invisible using the interface. ITie number of questions about 
how to use onr system has dropped dramatically. Anyone using our libniry 
can search the online catalog for a known item or subje<,t without needing 
help. Staff morale has improved tremendously. 

Fast Interface and Short Interactions with Databases 

Transmitting each line back and forth between a terminal and a 
distant mainframe can be slow. In addition to the communication dis- 
tance, there is the possibility of slow response due to overloading the 
mainframe with additional searches. With a personal co'^^outer, ni^(hing 
leaves the terminal until the search strategy is complete a.. J correct. Tlie 
initial communication is between the keyboard and the p.ogram in the 
personal computer and is therefore quite fast. Patrons know that the 
computer is working on iheir answer because the word ''searching" blinks 
on the screen until a respons'* appears. 

Adaptable and Easily Changed Interface 

It is advantageous to be able to improve the interface quickly as the 
s/stem capabilities change. A local interface, using software that is pur- 
posely easy to update, helps accommodate system clianges. It is also easy to 
test and fi*"; tune a local interface. Our interface lias gone through over 
thirty-five different versions in only three years. In addition, a micro it is 
possible to have different types or levels of an interface on different termi- 
nals. There could be one version of the interface for undergraduates and 
visitors and another not as detailed for faculty offices. 

The interface version available at UIUC allows the patron to charge 
out books. When a patron searches for a book, he is asked if he wants to 
charge out the book. The program asks for the patron's ID number and 
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then automatically charges the book out. An explanation is given on what 
happens next and whom to contact if there is a probhm. Other LCS si tes do 
not allow patron charging of local books and therefore have a different 
version of the interface. 

William Potter analyzed the effect our personal computers have had 
on borrowing materials from other LCS schools. Before the personal 
computers, interlibiary borrowing using LCS was 2.8 percent of UIUC's 
total circulation on LCS. After the introduction of *^rsonal computers, 
this figure jumps to over 8 percent. The increase in a^^solute numbers was 
from 35,182 items borrowed in 1982 by UIUC from other LCS schools to 
123,123 in 1985.® This represents a 350 percent increase. By the interface 
asking patrons if they want to borrow from another library, resource 
sharing increased significantly. Until the personal computers were intro- 
duced, patrons did not realize that they had access to over 15 million 
volumes around the state. This same concept could apply when the State- 
wide ILLINET Catalog becomes operational. 

Another creative use of our interface occurs at a local public librarv 
version of the interface, available on a personal computer at the Urbaua 
Free Library, automatically dials into our system. These public library 
patrons get to search our system to locate what they cannot find locally. 
Since our system searches by keyword and subjects, they also have more 
access points than the CLSI terminal located nearby. This sam^, concept is 
applicable to people with home computer^ and modems. 

increased Computing Power and Decreased CosU per Megabvte 

When we purrhased our microcomputers in 1983, they cost us approx- 
imately $1700 each for 128K and no disk drives. Today, a personal compu- 
ter with two disk drives and a ten-megabyte hard disk costs around $1600. 
Although this cost is somewhat higher than a dumb terminal, the benefits 
in computing power, speed, and potential to access other systems far 
outweigh this difference. The era of the twenty- or thirty-megabyte hard 
drives is rapidly giving way to drives with gigabyte storage. The micro- 
computer of today is equal in power to and '-^wer in price than many 
minicomputers of a few years ago. 

We are at present investigating the feasibility of putting the statewide 
ILLINET Online Catalog on compact disc-read-only me.nory (CD-ROM) 
using the LePac system developed by Brodart.^ Using c impact disc for 
II LINET cou'd extend access to this val uable resource to even the smallest 
libraries. Compact disc technology would not require telecommunications 
hookups or charges, and expanding the network would not create the need 
for additional mainframe computer facilities. With the era of ''write often, 
read many times*' CD-ROM around the corner, many online systems could 
fit into d micro having the capability of storing 4 million MARC records. 
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Hardwired Microcomputers Can Search Multiple Databases and Catalogs 
All of the eighteen regional system libraries in Illinois have access to 
our online catalog. They use the online catalog for interlibrary loan and 
bibliographic verification. Most of them also have their own online cata- 
logs or circulation systems. Having to remember commands for their own 
systems and then our difficult comm?»nd structure caused many problems. 

Through an Illinois State Li.ji^xy grant, Cheng set up microcompu- 
ters in three differed system libraries. These personal computers search 
our online catalog using his interface. Then, by pressing a function key, 
they instantly switch to their Data Phase, CLSI, or DRA local circulation 
systems to search using the command language. Two communications 
cards and changes in the interface make this switching between systems 
located on two different types of computers possible. They use the keyword 
and subject searching capability of our system and then search for the 
books on their systems. This occurs uiing the same microcomputer. With 
additional programming even this process could be automatic and 
searches could be saved and executed from system to system. 

Microcomputers as Information Gatekeepers 

A microcomputer with an 100-megabyte hard disk or gigabyte CD- 
ROM drive can store and search local or vendor-supplied databases. It is 
possible to purchase portions of databases from BRS, mount them on a 
mainframe, and search using a microcomputer. This same microcomputer 
could store the results of that se::rch. This result could then be run against 
an online catalog to see if the local library owns the journals or reports. 

We recently purchased InfoTrac for our Undergraduate Library. Info- 
Trac searches for magazine artic^s using an optical disc and a personal 
computer. At present, students must leave the InfoTrac terminal and 
proceed to a different terminal to search the online catalog. It v/ould be far 
more beneficial if this same microcomputer could switch an(' search our 
online catalog. A patron would instantly know if we had the journal and 
its call number. We have already tied two different online systems together 
in the system libraries. This sami microtechnology could also do this with 
InfoTrac and an online catalog. Having to search systems using different 
terminals would become obsolete. 

An atten>5jt at an end user searching system connected to the online 
catalog is bdng^jdeveloped by our engineering librarian. Bill Mischo. 
Figure 6 is a protot^ oe screen for a microcomputer-baseJ system for librar- 
ians toautomatically^v^ial up and log on many systems.^^Bill is adapting 
this prototype to allow students to automatically dial up some databases 
on BRS. After searching BRS, the results are run against our online catalog 
to determine if we own the items. All this will take place using an IBM-AT 
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personal computer. This one termi"- ' ill be an information gatekeepei 
and will allow access to books and i«. various periodical indexes. 



DATABASE VENDORS OR NETWORKS: 

0. LCS/FBR 

1. BRS— TELENET 

2. BRS— TYMNET 

3. DIALOG— TELENET 

4. DIALOG— TYMNET 

5. OCLC— TELENET 

6. OCLC— TYMNET 

7. RLIN— TYMNET 

8. SDC— TELENET 

9. SDC— TYMNET 

10. RESUME SEARCHING- ALREADY ONLINE 

11. PRINT PREVIOUSLY DOWNLOADED DATA 

12. SIMULATION OF ONLINE SEARCH 

13. LOGON BY SEARCHER 

14. LCS VIA SWITCH 

15. REFERENCE INFORMATION 

16. KNOWLEDGE INDEX-TELENET 

17. KNOWLEDGE INDEX— TYMNET 

CHOOSE ONE OF THE NUMBERS 
Figure 6. Illinois Search Aid for Expediting Online Database Searching 



Although I painted a rosy picture of microcomputeis and how they 
help make our system user friendly, there is one problem for diri-access 
patrons. Unless the dial-up user has an IBM personal computer, he or she 
will have to use commands to search our system. Since we support nine 
dial-up ports and two ports connected to a coaxial cable on campus, this 
could be a problem. However, the trade-off for having; a user friendly 
system in the library v. the old system made the microcomputer our answer. 

Conclusion 

I began this discussion with a quote by McLuhan who said "If it works 
it's obsolete." That might be true of some technology but not completely 
true when discussing microcomputers. Any given machine, like the eight- 
bit machine, might become obsolete. However, this indicates that care be 
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taken in choosing a machine that is state of t>e art. Another important 
consideration is that the microcomputer is expandable to take advantage of 
any future technological changes. A recent advertisement for the Compaq 
portable sums up the revolution taking place in microcomputers. It said: 
"Introducing the new Compaq Portable II — 30% smaller, 17% lighter, and 
400% faster!"" 

Finagle's statement ''The information you need is not available" is 
also rapidly becoming obsolete. It is possible that the information is 
available in one of over 3010 publicly available databases or within one of 
the 1.7 billion online records. The problem that remains is how to make 
the public aware of this fact and how to <?llow them easy access. 

Our experience with microcompuuxv has led me to the realization 
that they are the tool to unlock these bulging information storehouses. Just 
as our interface made resource sharing easy and increased interlibrary 
borrowing, so too could microcomputers act as information gatekeepers. 
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Natural Language User Interfaces 
in Information Retrieval 

Introduction 

This paper examines the role of natural language (NL) processing in 
information retrie' il in the context of large operational information 
retrieval .ystems and services. State-of-the-art information retrieval sys- 
tems combine the functional capabilities of the conventional inverted 
file— Boolean K'gic term adjacency approach— commonly employed by 
commercial sean h services, with statistical-combinatorial techniques pio- 
neered in experimental information retrieval (IR) research, and formal 
natural language processing methods and tools borrowed from artificial 
intelligence (AI) iTie emergence and ever increasing importance of end- 
user searching provides challenging opportunities for the integration of 
sophisticated natural language analysis and processing techniques in user 
friendly interfaces. 

IR systems achieve remarkable search speed and flexibility despite the 
virtual absence of formal language analysis procedures and menning inter- 
pretation of the underlying text content.^ State-of-the-art IR systems prag- 
matically blend the best features of diverse probabilistic-combinatorial 
and Boolean logic retrieval models and readily support free-form natural 
language user interfaces.^ Yet, such direct natural language interfaces need 
to incorporate more sophisticated natural language processing and other 
Applied artificial intelligence techniques in order to cope intelligently 
with the inherent ambiguities of natural language queries and text, and to 
compensate for the inevitable semantic loss and confounding inben «. in 
indexing and query matching.^ 
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By the same token, AI approaches to natural language processing, 
particularly as applied to KL user interfaces and text searching, benefit 
from proven IR concepts and techniques in order to transcend still- 
prevalent domain-specificity and performance problems.^ Thus natural 
language databases and user friendly online searching (UFOS) represent 
challenging and mutually supportive common problem areas for both IR 
and AI. 

bifonnation Retrieval 

Although a variety of access methods have been developed for text 
retrieval,^operational IR systems art, almost without exception, character- 
ized by the dominance of the inverted file. Boolean logic search paradigm.^ 
In their basic form, IR systems are designed to manipulate fundamentally 
simple natural language text structures— such as bibliographic citations 
or full- text documents— although some IR software packages have been 
enhanced to incorpoiate generalized database management system 
(DBMS) access methods and special processing function^ needed for the 
handling of integrated textual, numeric, graphics, and image data. The 
automatic "indexing rules" or algorithms are, by and .«rge, lexically based 
procedures guided by delimiters and/or lists of nonindexed ''stopwords/' 
and the resulting inverted search keys are typically organized into B-tree 
structures for acceptable trade-off between speed of access and ease of 
updating. This approach is, for practical purposes, highly flexible and 
domain independent, although distinct indexing rules are needed in order 
to usefully fragment special textual fields, for instance, chemical names. In 
addition, in most IR systems, the autcmai. ally generated keywords are 
frequently augmented with human-assigned subject headings or thesaurus 
entries, mosdy noun phrases, for added syntactic and semantic precision in 
searching. 

Search queries, regardless of their form of input, must be ultimately 
transformed (typically, by trained intermediary searchers) tf^ conform to 
the basic indexing scheme of the given IR system. Quei/ search keys are 
matched against the inverted file searjh keys, and the corresponding 
ordered inverted lists are rompared usirg exact-match Boolean AND, OR, 
AND NOT logic operators implemented as set operations (intersection, 
union, and complement, respectively) on the inverted lists. The process is 
exceedingly fast and efficient as implemented via currendy available hard- 
ware and software architectures. The lack of linguistic and cognitive 
analysis procedures at indexing time and the resulting precision/recU 
problems are, to some extent, alleviated by the availability of powerful 
patten.-matching functions and metric (hierarchical/positional) opera- 
tors, such as character masking, truncation, and adjacency . Trainedinter- 
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mediary searchers, in turn, provide the needed augmented ^'knowledge 
base," "inference engine," and '^control strategy" for the proper configura- 
tion and sequencing of retrieval operations. 

In the last few years, operational IR systems have been implemented 
that reflect the influence of G. Salton's seminal work and incorporate 
many useful ideas and results from theoretical and experimental IR 
research going back to the mid-1960s7 

IR research on the whole has been dominated by nonlinguistic, pri- 
marily lexical -statistical NLP methods as exemplified by Salton's SMART 
system® and the work of researchers such as Bookstein, Kraft, Rijsber^^en,^ 
ard many others. Althougl. experimental IR research prototypes have 
often suffered from problems of sca^e limitations in their technical 
approaches, IR research has nonetheless contributed a coherent concep- 
tual framework and identified a core of desirable IR functions and evalua- 
tion methods that are of particular relevance to the design and 
implementation of end-user oriented information retrieval systems and 
services. The most important among these are the notions of ( 1 ) unrestrict- 
ed natural language query input, (2) close?*-match search strategy, (3) the 
ranking of the retrieval output according to expected relevance to tue 
query, and ,'4) the dynamic utilization of user feedback in automatic query 
refcmulation and search strategy modification. 

Large-scale implementations of NL IR interfaces, such as CITE,^° 
efficiently combine the best features of the Boolean and the 
probabilistic/combiiiatofial retrieval models with limited ''intelligent" 
computational linguistic analysis and Al-type search heuristics. Such 
end-user oriented systems treat unrestricted natural language both in 
queries and text records as the least common denominator among different 
searchers, databases, and IR systems, offering the potential of true trans- 
portability and transparency among diverse users, information sources, 
and search systems. In order to successfully emulate the trained searcher's 
augmented retiieval "knowledge base," ''inference engine," an J "control 
strategy," however, UFOS must possess intelligent NLP cap :ibilities of 
greater sophistication.^^ 

The convergence of a number of hardware, software, and user trends 
necessitate the augmentation of conventional information retrieval and 
filtering capabilities with appropriate and efficiently implemented tech- 
niques adapted from AI application areas— such as natural language 
processing and understanding, expert systems, intelligent information 
management, and intelligent problem-solving The trends include: full- 
text databases; very large databases; mixed information sources containing 
text, numerics, graphics, image, and other data; the increased diversity, 
depth, and breadth o'' information sources; hybrid technologies (e.g,, 
compact disc-read-on^ ' memory, CD-ROM); special-purpose IR hard- 




NATURAL LANGUAGE USER INTERFACES 



83 



ware; non-von-Neumann comr^utcr architectures; associative memories; 
distributed and parallel proceu. ig; and interactive end-user searching and 
personal, computer-enhanced ''information metabolism." 

Artificial Intelligence 

Artificial intelligence, particularly applied artificial intelligence, is a 
growth industry of high expectations.^^ Notwithstanding the difficulty of 
defining intelligence, let alone artificial intelligence, and despite the many 
lingering doubts and the seeming tower of Babel situation, the AI field 
possesses a healthy basic research underpinning and the AI community has 
been successful in developing important concepts, tools, techniques, and 
applications of interest to other information-intensive disciplines.^^ Just 
as "conventional" programming languages and diverse 'ata representa- 
tion ntodels in "classical" IR and DBMS systems serve as problem-solv ing 
tools for a wide variety of applications, AI knowledge representation 
models, such as production rules," frames,^^ and semantic networks as 
well as AI programming languages, such as LISP (list processing), 
PROLOG (logic programming),^ 0PS5 (rule-based programniing)/'' 
SMALLTALK (object-oriented programming), and other general purpose 
techniques and tools (e.g., heuristic search, ATN [automatic translation 
network] parsers, and rapid prototyping,^®) repusent versatile AI in.ple- 
ments for a broad range of applications including IR. 

Natural Language Processing — Problems 

NLP and computational approaches to dealing with natural Ian- 
gjage were among the earliest objects of interest in AI research. The 
well-known ambitions and subsequent failures of machine translation 
research and development projects in the 1960s, as well as the recent3uccess 
of more limited yet highly pragmatic computer-assisted language transla- 
tion systems, epitomize both the great difficulties involved in coping with 
natural language and the degree of maturation and realistic goal setting of 
the field. 

Understanding language, though it appears to humans to be natu- 
rally easy, is a difficult task that involves highly cognitive and not yet 
totally understood intellectual and psychological processes.^^ Schank 
argues that NLP is both a linguistic and a cognitive '-^sk, and it cannot 
occur without a knowledge base concerning th^ relevant subject area.^ 
This in turn points to a synthesis of natural language processing and 
t. ^ ert systems techniques. Typical areas of concern and investig ition in 
AI NLP research involve automated lexical, syntactic,andsemantir,a-ialy- 
sls; dealing with ill-formed or fragmentary input* 'ellipsis; conjunction 
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and negation; pronoun/anaphora resolution; definite noun phrases, 
quantification; beliefs and intentions; fail-soft recovery; space and time 
and contextual understanding— to name just a few. 

The role of NLP in information retrieval is less clear^^ given that the 
very nature of the textual documentj* traditionally dealt with in IRdoes not 
warrant elaborate anal ysis, and the fundamentally mechanistic techniques 
developed in IR have served to handle the passive query-to-documenl 
Hiatching problem at an aco ptable level of success. The advent of unme- 
diated "user friendly" seaich interfaces on the one hand, and the consider- 
ably broader scope and depth of today's full-text databases on the other, 
however, necessitate a careful reexamination of this role. 

Computational linguistic processing has reached fairly reliable stabil- 
ity and practical utility in moiphological and syntactic analysis. While 
procedures for morphological analysis are decidedly nontrivial, they are 
generally more straightforward and efficient than syntacticanalysis. Com- 
mon strategies for morphological analysis employ some or all of the 
following: morphological rewriting rules, dictionary lookup, inflection 
generator, complexity testing, and idiom and compound recognition.^ 
Prefixing and suffixing arc the main concern in processing English. Often, 
in processing specialty languages— e.g., medical English—special atten- 
tion must be paid to characteristic prefixing, suffixing, and morphose- 
mantic problems.^ Compare, for instance: 

ALEXIA— DYSLEXIA 
VITAMINS— AVITAMINOSIS 
ANXIETY— ANTIANXIETY AGENTS 
INFECT— DISINFECT 
-^iYPERTENSION-HYPOTENSION 
ADJUSTMENT-MALADJUSTMENT 
ANTIBIOTICS— "-MYCINS" 
INFLAMMATION— "-ITIS" 

Reasonably— i.e. relatively — robust and efficient NL syntactic parsers 
have been developed and incorporated into many artificial intelligence 
applications. "Parsing efficiency is ciucial when building practical natu- 
ral language systems. This is especially the case for inten ziive systems, 
such as natural language database access, interfaces to expert systems and 
interactive machine translation."^^ Tomita's LR context-free parsing 
algorithm, for instance, takes advantage of the left-to-rightness of natural- 
language user i ^ut, and parsing ctarts as soon as the user types the first 
word, thus reduL ^ apparent response time from the user's point of view. 

Despite significant advances, a number of unsolved problems and 
limitations in AI NLP remain. Most importantly, none of the systems 
developed to date are fluent in the use of unrestricted natural language.^ 
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For potential IR use of AI NLP techniques, it is important to remember 
that while most full sentences?ire unambiguous, their component parts are 
frequently ambiguous. Since IR systems utilize component or fraynrientary 
lexical, syntactic, and semantic units as a result of indexing, and since 
indexing inevitably implies omission, the disambiguation i>roble* ^ yn 
domain-independent IR systems is much less tractable than in narrow 
domain artificial intelligence applicaaons. 

Acoustic-phonetic, lexical, syntactic, and pragm? tic language com 
plexity, as reflected in word-sense ambiguity, structuial ambiguity, and 
the referential ambiguity of noun phrases, pose very challenging problems 
in user friendly artificial intelligence and information retrieval interfaces, 
"^-^w if any existing NLP systems, for instance, are able to recognize and 
uiSambiguate compounds (compare database v, data base), acronyms 
(compare G-SUIT), or abbreviations (compare MD ==> Maryland ==> 
physician) in large textual databases. 

Noun-phrase ambiguity may be further compounded in keyword 
indexing by ignoring word order, field, and other text boundaries, and by 
partial match search strategy in query matching, e.g.,: 

MEDICAL LITERATURE =•■-=> MEDICINE IN LITERATURE ==> 
LITERATURE IN MEDICINE 

SEXTJAL PERVERSION ==> SEXUAL ABSTINENCE 
There is a lot of h umor in all of this, as in.* 

HUMOR IN MEDICINE ==> AQUAEOUS HUMOR IN MEDICINE 

or, put differently, language understanding can be a joke. For proper 
perspective it is worth keeping in mind that there still does not exist any 
NLP computer program that can handle nearly all of English syntax. 
None can even come close to coping with the semantics of all of the English 
language and none is on the horizon. "Wretched and confusing prose can 
defeat even human comprehension.''^ 

The imp^' ~'bly large num.'>er of rules tnat would be necessary for the 
morphological, syntactic, and semantic disambiguation of natural lan- 
guage in real-life multidisciplinary knowledge domains precludes using 
the rule-based expert system approach as well. (No accurate estimation has 
been made of how many context-free rules are needed to cover English 
almost completely^ but the number is very large.)^ From the survey of the 
literature it appears that artificial intelligence is not quite ready to field 
any system flexible enough for mass use, save a fev/ relatively small prob- 
lem domains.^^ 

Can information retrieval facilitate finding artificial intelligence 
solutions to practical natural language processing problems? Certainly so! 
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Domain-independent IR search techniques can serve as efficient filters for 
more refined in-depth natural language processing. As more and more 
powerful AI concepts and tools become available (compare AI PC toolkits) 
to more and more end users, the problems of scale and performance 
limitations that characterize conteniporary AI systems will be gradually 
overcome. To the extent that the AI and IR communities will be able to 
learn from each other and gain mutual insights into the problems of 
language and searching, there will exist intelligent IR systems and effec- 
tive domain-independent AI NL search systems. And perhaps— in theory 
at least— the distinctions between the two (mind) sets will be "fuzzy" at 
best.^ 

Natural Language and Usei Friendly OnLne Searching 

Natural language interface technology represents a major break- 
through in "user friendly'* computer systems.^^ Along with other end-user- 
oriented interface techniques (such as menus, windowing, graphics, icons, 
pointing, touching) commercial implementations of NL interfaces (such 
as Artificial Intelligence Corporation's INTELLECT or Texas Instru- 
ment's NLMenu DBM^ front-end products) target the largest hitherto 
untapped segment of the information marketplace, namely end users. The 
same holds true for public access online catalogs in libraries and for user 
friendly IR interfaces in general,^^and for natural language information 
retrieval interfaces in particular (e.g., the National Library oi Medicine's 
CITE system for searching the world's largest medical literature databases, 
MEDLINE and CATLINE).^^ 

Online systems and the personal computer revolution have made 
computer resources universally available. Now a similar revolution in 
software (i.e., user interfaces) is needed to make the computer universally 
usable.^^ Natural language interfaces in information retrieval and artifi- 
cial intelligence are the scouts, shock troops, vanguards, and sometimes 
martyrs of the user interface revolution. 

With occasional end users already outnumbering t«ained professional 
searchers in the user populations of online information utilities like The 
SOURCE and CompuServe, it becomes increasingly imF>ortant to develop 
and refine analytical cognitive models to better assess the user's skills and 
understanding of the infor*^ation stored, and to match the user's cognitive 
model to the system's mouei aud knowledge representation. 

Of course natural language is not always natural for a user interface,^ 
but it is particularly well suited for IRand DBMS interfaces due to the very 
large number of potential users, high volume of query transactions, distri- 
bution of costs over large numbers of users, and the fundamentally lingu:s- 
lic nature of the user-system information exchangc.^^ 
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Ease of learning, ease of use, and transportability are among the most 
attractive features of natural language front-ends. Nraural language shifts 
the burden of understanding from the user to the system thus allowing the 
user to focus on the problem at hand.^' At the current state-of-the-ar? , the 
high development cost and less than 100 percent reliability of knowledge- 
based domain-specific NL DBMS «vstems (compare EXPLORER, deve- 
loped by Cognitive Systems, Inc. for oil exploration)appearto have palled 
their widespread development and commercial use. Considerably more 
commercial success has been achieved by domain-independent NL DBMS 
front-ends. INTELLECT, forcxample, substitutes knowledge of the phys- 
ical and logical structure of the database and the topology of user interac- 
tions and system functions for domain-specific semantic knowledge, using 
the inverted DBMS index as lexical pointers to the system's rather small 
domain-specific and semantically augmented dictionary. Despite its short- 
comings in ambiguity resolution,^ INTELLECT succeeds in its practical 
utility, reliability, and operational performance. 

It is interesting to note that while a DBMS NL front-end like INTEL- 
LECT has to cope with the full spectrum of linguistic processing problems 
of a "habitable subset" of the English language— namely the l?^'^ed 
domain of tlie DBMS command language and a relatively small number of 
query interaction paradigms— it does not have to deal with language 
ambiguity and matching problems at the level of the database content, due 
to the fact that DBMS systems deal, for the most part, with discrete and 
finite nontextual data. 

By contrast, a natural language information retrieval textual database 
interface, like NLM's CITE system^^ must focus on language ambiguity 
and matching problems for all of English. At the same time, CITE need not 
invest a great deal of effort in full-scale linguistic analysis in query-io- 
command language translation due to the relatively small number of IR 
commands available and the simplicity of the underlying database struc- 
ture. (It would be perfecdy feasible and appropriate to use INTELLECT- 
like NLP techniques, instead of menu choices, in CITE to ''understand'* 
the type of query at hand, identify its topical component, as well as any 
implied or explicitly stated liri?iiations as to type of material desired, 
language restrictions, currency of ms erial.) 

Natural Language Queries in NL Databases 

Natural language information retrieval interfaces must then deal with 
the problem oi language ambiguity at the level of both the query and the 
database content and mu?t resolve the matching problem between free- 
form queries and the database in a manner acceptable to end users who 
typically approach a natural language system with high expectations of 
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(arlificin^ machine) inielligence. Tlie basic problem areas of matching can 
be conveniently divided into lexicil, syntactic, semantic, and special 
concerns. 

Lexical Problems 

Tlie inverted keyword index of a NL database of 1 million records is 
likely to contain in excess of a quarter million distinct lexical words,^°The 
frequency distribution of these lexical entries will typically conform lo the 
characteristics of the Zipf distribution. Tliis empirical fact has the follow- 
ing practical implications: 

1. There will be a few dozen to a few hundred highly ix>sted (i.e., high 
database frequency) index entries, many of which will be natural candi- 
dates for exclusion from the index (compare ''stopwords"). 

2. Approximately half of all the index entries will have a database fre- 
quency of one, and as many as half or more of these may be misspell- 
ings. Tliis suggests the incorpory^tion of efficient spelling error 
detenion/correction algorithms and dictionaries in the NL user inter- 
face and in indexing, with the dictionary preferably derived from and 
dynamically updated in conjunction with the database itself. 

3. Knowledge of the frequency distribution of the leximl entries suggests 
implicit heuristics for automatic search strategy fonaulaiion. Tlie NL 
interface must be capable of recognizing and properly dealing with 
frequent acronyms, abbreviations, numerals, chemical names, names of 
people/syndromes: 

X RAY ==> X-RAY; CAT SCAN; US ==> USA ==> U.S. ==> U.S. A ; 
AI ==> ARTIFICIAL INTELLIGENCE; VITAMIN B 1 ==> 
VITAMIN Bl ==> THIAMIN; TYPE . ==> TYPE I; FACTOR V- 
2,4,5-T ==> TRICHLOROPHENOXYACETIC ACID ==> AGENT 
ORANGE; EPSTEIN-BARR VIRUS; BARR BODIES; BARRE-LIEOU 
SYNDROME; GUILLAN-BARRE SYNDROME 

orthographic and phonetic transcribing and transliteration: 

TUMOUR ==> TUMOR; 

GYNAHCOLOGY =--^> GYNEKOLOGIA ==> GYNECOLOGY 
idioms and cliches: 

OFF COLOR; CHANGE OF HEART; OUT OF SIGHT 
slang, lingo, jargon, and lore: 

POT; SPEED; ANGEL DUST; CRACK 
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stopwords that are noun-phrase components: 

VITAMIN A; HEPATITIS A; "TO BE OR NOT TO BE"; 
THE "ME" GENERATION 

compound words: 

DATABASE ==> DATA BASE; ONLINE ==> ON-LINE ==> 
ON LINE; BACKACHE ==> BACK ACHE ==> BACK PAIN 



Morphological Analysis 

Morphological analysis (stemming or "conflation") warrants special 
attention in large-scale natural language information retrieval interfaces. 
The automatic identification of lexical roots in invertec" file based opera- 
tional IR systems is the first step in the process of matching query words 
and inverted index entries. The latter may have been derived from auxiliary 
vocabularies that serve as semantic database navigational tools, or from the 
database records themselves. The following. r3:amples from the MEDLINE 
and MEDICAL SUBJECT HEADINGS inverted indexes illustrate ever- 
present and familiar lexical ambiguities and semantic noise introduced as 
a result of using word roots with variable-length masking openitions in 
matching: 

ACCESS —•> ACCESSORY 

ASPIRIN ==> ASPIR.\TION 

AUDIT ==> AUDITORY 

BATTERED ==> BATTERY 

COM.MUNICABLE ==> COMMUNICATIONS 

CREATINE ==> CREATIVENESS 

DIGITAL ==> DIGITALIS 

EXTREME ==> EXTREMITIES 

EXPECTATION ==> EXPECTORANT 

INFANT ==> INFANTILISM 

INFORM ==> INFORMAL ==> INFORMER 

LABOR ==> LABORATORY 

MEDIA ==> MEDIAN ==> MEDIATED 

METHOD ==> METHODIST 

MIGRAINE ==> MIGRANT 

NURSERY ==> NURSES ==> NURSING 

RECEPTION ==> RECEPTORS 

SHORT ==> SHORTAGE ==> SHORTHAND 

TREAT ==> TREATMENT ==> TREATISE ==> TREATY 
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Short words require even more caution: 

AID ==> AIDS 
ANAL ==> ANALYSIS 
APE ==> APES ==> APEX 
ARM ==> ARMY 
CARE ==> CAREER 
FAIR ==> FAIRS 

HEAR ==> HEARING ==> HEART ==> HEARTWATER ==> 
HEARTWORM 

Many thousands of other such examples could be found. 

Syntax-Related Problems 

Natural language topical surrogates— e.g. book and journal titles, 
headlines, table of contents, back-of-the-book indexes— are usually 
expressed via larger syntactic units, mostly noun phrases. Noun phrases 
are frequently ambiguous in and of themselves— e.g., SHELLFISH POI- 
SONING ==> POISONING OF SHELLFISH ==> POISONING BY 
SHELLFISH. The use of Boolean operators, metric operators, and m out 
of n weighted-logic closest-match search strategy— in order to compensate 
for the lack of linguistic analysis in indexing— further compounds the 
text-matching problem. Consider, for example: 

ABUSE ==> CHILD ABUSE ==> DRUG ABUSE ==> 
ELDER ABUSE ==> SPOUSE ABUSE 

CRISIS MANAGEMENT ==> MANAGEMENT CRISIS ==> 
MANAGEMENT BY CRISIS ==> CRISIS BY MANAGEMENT 

KIDNEY ==> IvIDNEY BEAN LECTINS ==> KIDNEY DISEASES 

SEXUAL ABSTINENCE ==> SEXUAL PERVERSION 

SHORT TERM EFFECTS ==•> SHORT TERM MEMORY ==> 
SHORT TERM PSYCHOTHERAPY 

Since literally hundreds of similar lexical and syntactic matching prob- 
lems are encountered daily in a large operational NL IR system, it is 
evident that automatic query analysis and matching can substantially 
benefit from morphological and syntactic analysis in ord.;r to lend addi- 
tional precision to the available truncation, character masking. Boolean, 
metric, weighted-logic, or generalized pattern-matching strategies. Con- 
sider for instance the automatic generation of Boolean search statements 
from NL queries.'" 
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Semantic Problems 

A great many formal semantic aids and ad hoc heuristics are used by 
trained searchers when interacting with information retrieval systems. 
Some examples are controlled vocabularies, "hedges," "preexplodes," 
multidatabase cross indexes, stored search strategies, and the like. Systems 
that rely on controlled vocabularies often lack in currency, database war- 
rant, or conceptual exhaustivity. For instance. Medical Subject Headings 
does not currently (1986 edition) have a subject heading for BIOTECH- 
NOLOGY, and it uses PMS as a cross reference to an older subject heading 
PREGNANT MARE SERUM (GONADOTROPINS, EQUINE), but at 
the same time PMS is not linked to PREMENSTRUAL SYNDROME, nor 
is AI linked to ARTIFICIAL INTELLIGENCE. 

As was noted earlier, systems without automated semantic aids shift 
the full burden of query understanding and matching on the searcher. NL 
IR interfaces must minimally rely on and must intelligently utilize exist- 
ing machine-readable semantic search aids. The existing aids,, however, 
need to be augmented by additional semantic mapping tools such as 
statistical term associations, switching vocabularies, enriched "fuzzy" the- 
sauri, "scriptal" micro-lexicons, production rules, and/or heuristics eli- 
cited from expert searchers. 

The natural language information retrieval interface must also be 
designed to deal with special problems such as multiple languages, spe- 
cialty languages, dialects, and professional jargon."*^ To a considerable 
extent, the CITE experimental R&D system and its operational versions 
have attempted to address many of the linguistic problem areas oudined in 
this paper.^^ 

CITE represents a domain-independent NL IR interface approach 
that combines conventional inverted file. Boolean logic with term fre- 
quency-based weighted logic, closest-match search strategy and efficient 
NLP techniques involving "intelligent" stemming,^^ partial syntax analy- 
sis, automatic query-to-controlled-vocabulary mapping, look ahead 
search ambiguity resolution and filtering of combinatorial controlled 
vocabulary term displays, automatic user feedback processing, and other 
techniques adapted from applied AI such as domain-specific semantic 
navigational tools, refined textual pattern matching, and ad hoc expert 
searcher heuristics. The appendix illustrates several NL user interactions 
on the CITE system. The last example serves to put things in humbling 
perspective: The search query NATURAL LANGUAGE PROCESSING 
automatically picks up the subject heading NATURAL DISASTERSI 
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Other AI Applications of Direct Relevance to IR 

In addition to natural language processing techniques in general and 
NL DBMS front-ends in particular, the following artificial intelligence 
aieas are perceived by this author to be of direct relevance to IR: 

L Expert systems. To the extent that rule-based expert systems are instan- 
ces of very high level programming tools that allow the expression of 
order-independent rules instead of ad hoc pieces of order-dependent 
conventional program code, they can be of benefit in NL IR interface 
development in capturing trained searcher expertise as well as codify- 
ing broad linguistic processing rules. Efficient microcomputer imple- 
mentations of rule-based expert systems are becoming increasingly 
available/^ 

2. Intelligent infor.ration management. Intelligent information manage- 
ment involves tiie analysis of the interrelationships among multiple 
databases, information sources, user behavior, including observation of 
past actions and codified procedures, in order to develop rules for 
enhanced data retrieval and management.'*® Elements of this approach 
have been utilized in information retrieval— e.g., in the PAPERCHASE 
system^'^and the Syracuse University SUPARS Project. The systematic 
development and utilization of intelligent information management 
techniques should benefir IR systems in the future. 

3. AI knowledge ^epreser? :ation techniques. In general, AI researchers 
have found that amasbing large amounts of knowledge rather than 
sophisticated reasoning techniques a'le responsible for the power of 
expert systems.^* The knowledge encoded in conventional control kd 
vocabularies can be potentially augitiented via rule-based relationships, 
"ISA" knowledge-representation cbnstructs,^^ or predicate calculus 
statements and fuzzy logic. 

4. Integrating IR, DBMS, AI, and other technologies. To date, relatively 
little work has been done in such integrative R&D. Videotex and 
CD-ROM systems are perhaps the most promising applications in this 
category. The latest videotex systems— e.g. The SOURCE and 
CompuServe— combine sophisticated IR, DBMS, electronic mail, and 
other technologies, and typically offer full-text inversion and Boolean 
search, distributed database management as well as menu-driven, tree- 
structured, user friendly access. CD-ROM database publishing and 
special -purpose knowledge-base applications similarly combine state- 
of-the-art IR, DBR^S, AI, and overall information-management tech- 
nologies. The integration of efficient NLP techniques and intelligent 
computer-assisted instruction^^ capabilities will enable videotex and 
CD-ROM users, and users of diverse information utilities to augment 
their own intellectual power by machine intelligence that is perhaps 
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going to be able to grasp— if not understand— database information 
content, discover new relationships, synthesize new knowledge, and 
postulate new hypotheses. 
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Design Issues in Automatic Translation for 
Online Information Retrieval Systems 



Introduction 

One objective of computer intennediary systems is to minimize incidental 
and accidental differences among the manv distinct languages found in 
online bibliographic retrieval. Three classes of languages are identified: 
access protocols, reuieval commands/responses, and database structures. 
Each class has its own characteristic requirements for automatic transla- 
tion. In developing one intermediary product— the Sci-Mate Searcher — 
distinct translation approaches proved most effective for each class: a 
procedural language for access protocols, customized coding for retrieval 
commands/responses, and a knowledge-based table for database struc- 
tures. Despite differences in translation methods, users are presented with a 
consistent view throughout the product. 

The Problem: Online Babel 

Online bibliographic information retrieval, from a systems point of 
view, is not user friendly. Cing many heterogeneous online bibliographic 
services can be difficult for professional searchers and nearly impossible for 
occasional end users for several reasons. These include the problems of 
database selection, strategy development, and the overwhelming and 
sometimes contradictory details of usage and syntax. This paper addresses 
this last source of difficulty, one that is more likely to be solved in the near 
future by automation than the semantic and subject-knowledge issues. 

Online services provide access to enormous amounts of information 
but at the same time pose linguistic barriers to their own broad usage. The 
number of services with distinct protocols and languages continues to 
grow. There are now five major packet switching networks in the United 
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Slates and one in every European country. At least fifteen bibliographic 
database hosts in the United States, Canada, and Europe offer hundreds of 
databases with many of the specialized databases found only on a single 
host. Besides containing unique information, each database is structured 
with distinct field designations and data coordination conventions. 

The linguistic conventions used in online searching are by design 
terse and cryptic. With cost a function of time spent online, brevity is 
mandatory. For searching to be cost effective at even the relatively fast 
transfer rate of 2400 baud, a minimal user environment is preferable. 
However, this does not excuse the great diversity and incompatibility of 
comm?nds, codes, and conventions. 

From one system to the next, a given function usually is invoked by a 
keyword entirely unique to the system. With a few exceptions, there is no 
opportunity to define synonyms or otherwise improve consistency among 
the systems used. The babel of distinct protocols and language conventions 
now being used by online systems derives from the history of their 
development. 

The packet switching networks- -Telenet, Tymnet, Uninet, and 
Infonet — were developed independently from one another as competitive 
services. Each interacts with users using their owii distinctive protocols 
and conventions. 

In the early and mid-1970s, development efforts in bibliographic 
search software were independently conducted by several firms, notably 
Dialog (Lorkheed), Orbit (SDC), and Bibliographic Retrieval Service 
(BRS). Dialog evolved out of Recon, funded by the National Aero'iautics 
and Space Administration, while Orbit evolved out of Elhill under contract 
with the National Library of Medicine. BRS Search, originally derived 
from IBM's STAIRS software, has always been a commercial search ser- 
vice. During this independent development, there was no coordination of 
language terminology and syntax. 

Many of the early commercial bibliographic databases derived from 
federal and private publishers of printed tertiary indexes. It is remarkable 
that data from so many diverse sources were brought together and made to 
work under one or more vendors' retrieval software. The data in many cases 
were not initially intended for distribution asanonlinedatabase. It is quite 
understandable that most databases follow their own distinctive indexing 
and fielding conventions. 

At the present time, economic and technical constraints work against 
significant change in the online systems' software. The networks, retrieval 
hosts, and database publishers all have invested thousands of man-years in 
software and data. Customers who have learned to use these systems 
depend on them remaining stable. All involved are understandably reluc- 
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tani to look for and convert to any proposed standard, such as the Euro- 
pean Common Command Language/ 

Given this situation, the information retrieval specialists often must 
make choices that are not fully satisfactory. The specialists may limit 
themselves to one or two host systems and a few select databases. The 
capabilities of the systems and the databases' content and structure can 
thus be thoroughly mastered to accomplish all that is possible with the 
selected facilities. But in so restricting themselves, vital information on 
hosts and databases not used will be missed. 

On the other hand, specialists may chi,ose to learn hew to use a broad 
selection of host systems and databases in order to access all possible 
sources of relevant information. Much of what must be learned are details 
of access protocols, commands, and database syntax— the only means to 
getdng the information itself. Learning and maintaining proficiency with 
many different systems is costly and may result in specialists with more 
diffuse and less expert skills than those who choose a more limited scope. 

Technical end users are even more restricted by the linguistic barriers 
posed by conventional database systems. They can afford less time than 
specialists to devoid to learning details of access, retrieval, and database 
structure. ITiey are frequently bewildered by the diversity of options. Most 
of the time, end users will turn to specialists to meet their information 
needs even though they know the technical terminology better and arc 
better equipped to judge the results of the search.^ 

A Natural Solution: Intermediate Translating Computers 

For more than ten years, various automated solutions to the linguisdc 
problem have been proposed and implemented. These usually consist of a 
computer placed between users and one or more host systems. Known 
generally as "computer intermediaries," the >e systems function in part as 
translators that mask incidental and accidental differences between Ian- 
guages in the access and retrieval process. 

Computer intermediaries provide a set of services that usually go 
beyond translation. Frequently a richer and more consistent user environ- 
ment adds customized value to the entire process. Uploading allows locally 
stored and maintained strategies to be sent to the host. Downloading 
allows results retrieved from host systems to be locally saved and processed. 
Assistance is given in selecting databases; online descriptions of the sys- 
tems and databases are available; accounting subsystems are provided; and 
results can sometimes be transformed and factored back into queries. 

Examples of mediating systems involving software running on stand- 
alone dial-up mainframes include the experimental CONIT, at MIT, and 
the former Chemical Substances Information Network (CSIN), funded by 
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the National Library of Medicine. Another switching and mediating ser- 
vice that is dialed up but uses microprocessor hardware is EasyNei.^ 

Examples of software that can run on the user's own microcomputer 
include several packages that are no longer actively marketed* OL'SAM 
from the Franklin Institute, InSearch and ProSearch from the Menlo 
CorjX)ration, and SearchMaster from SDC/ Microcomputer software cur- 
rently available includes the Sci-Mate Searcher (Version 2.0) from the 
Institute for Scientific Information (ISI), Micro-CSIN and the Grateful 
Med from the National Library of Medicine, and Search Works from 
Online Research Systems.^' 

In developing ISI's Sci-Mate Searcher, the use and structure of net- 
work access, retrieval languages, and databases suggested distinct methods 
for automatic translation. The characteristics of use and structure and the 
methods developed to accommodate them will be described in the next four 
sections of this paper. Particular features of the Sci-Mate Searcher will not 
be described. Rather, general principles of automating online language 
translation will be described. 

Two Interfaces: The Ustr and the External System 

When performing translation functions, an intermediate computer 
must manage two distinct language interfaces: one with the user and one 
with the external systems. Recognizing these two interfaces as distinct and 
isolating their distinctive operations in separate modules is essential for 
successful design.^ 

Tlie user interface provides all significant retrieval functions and 
capabilities to the user. Controlled here is what the userc-an request and the 
way it can be requested. Also controlled here is what the user is given from 
the external systCiii and the way it is presented. 

''The external system" refers to everything beyond the serial port of 
the intermediate computer. The external system interface defines what 
modems, networks, and host systems are supported. It also defines the 
functions and capabilities that will be used in interacting with these 
external systems. The commands issued on the serial line are constructed 
in the external system interface modules, and the responses from the 
external systems are first processed by these modules too. 

Intermediate translating software could be designed and written to 
directly convert user input into a form required by the current host and 
directly convert the current host responses into a standard form for the user. 
It is tempting to design and code this way. However, as more entities and 
capabilities are added to the external system, designing and writing yet 
another module for direct translation becomes quickly untenable; the 
mediation software soon becomes an intricate and incomprehensible net- 
work of code; and maintenance becomes impossible. 
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Design and implementation is much more easily managed if interme- 
diate data structures are defined for all transactions. These structurrs store 
inputs from both users and systems in a standard form. The structures arc 
accessed in separate steps for constructing acceptable commands for the 
network and host systems and for presenting consistently formatted infor- 
mation from the host to the user. The data structures serve as a buffer 
between the two interfaces. In the Sci-Mate Searcher, these data structures 
have been called the intermediate language. 

With the intermediate language, user input destined for the host is 
accepted witliout regard to which host is currently online. Only in the 
system interface step that follows are the particulars for the current host 
added to data extracted from the intermediate structure. Similarly, 
responses from the current host are stored in the intermediate language 
data structures. These are then accessed in a separate step as host- 
independent data which are transformed and presented to the user. 

Access Protocols: Description and Requirements 

Access protocols here refers to the process of negotiating modems, 
networks, and host system passwords. Access is often viewed as a rote but 
necessary nuisance. It involves a y,*ries of steps wh^ch depend upon the 
details of particular hardware and systems. Tliesc steps include the 
following: 

1. dial the network node, manually or through modem control; 

2. inform the network about the terminal speed and type; 

3. instruct the network about flow control and line padding; 

4. specify the name or address of the host system; 

5. negotiate the password(s) with the host; and 

6. answer any standard host questions about news, etc. 

Until the early 1980s, modems were usually dialed manually. Intelli- 
gent modems which allow software to control dialing now make it possible 
to fully automate this task. Among intelligent modems, models from D.C. 
Hayes have set the standard for control language. Data networks presently 
used in the United States for online access include Telenet, Tymnet, 
Uninet, Dialnet, and InfoNet. 

The user's primary requirement for automated access is to get con- 
nected to a particular host or database. Usersalall levels of expertise would 
like to log on by simply naming the database. All the steps of the process 
are amenable to complete automation. However, if a failure occurs and 
automatic access cannot be carried out, users should be able to choose 
alternatives such as trying again, quitting the session, or taking over 
manually. 
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A six*clal-purposc procedural language was fell lo l)C appiopriate for 
automating access in (he 3ci*Ma(c Searcher. Access isalgorithmic in (hat i( 
has a clear beginning and end. TransIa(ion by a procedural language is 
effective in access where only a smal 1 number of possible messc^ges from (he 
external system can be anticipated at any given moment in the process. 
These messages, and the lack of any message in a given period cf time, are 
known as potential states in the process. When a possible message is 
received, the state is said to be realized. 

The actions to be performed when a potential state is realized are few 
and readily defined and programmed. Tliesc actions mainly involve a 
message in response. After specifying the host or database, the user 
becomes largely an obser\'er during automated access. Tliis allows the 
intermediate computer to control and respond to other computers, a rela- 
tively straightforward task in the case of acn?ss. 



Retrieval Commands and Responses: 
Description and Requirements 

Tlie retriet^al languages of the bibliogniphic host systems consist of 
commands entered by users and host responses to these commands. In the 
United States rind Canada there are about half a dozen major host systems; 
in Europe there are at least six more. In each case, they accept a command 
consisting of a command verb (sometimes implied) followed by an 
argument. 

''Retrieval command and resfwnse" will refer here to language com- 
ponents that control the host retrieval software but are independent of any 
specific database. Most retrieval systems f^rovide at least the following 
basic commands: 

1. pick a database or set of da'^bases; 

2. browse inverted indexes to the database(s); 

3. select terms and specify term logic; 

4. display records from sets constructed; 

5. request records to be printed and mailed; 

6. review the sets created during the session; and 

7. leave the system. 

Additional capabilities are frequently provided tliat build on these 
basic ones. They include commands to: 

1. make selections from the inverted index display; 

2. search for complete phrases in database records; 

3. limit the search to years, updates, or languages; 

4. specify ranges of records and formats for dispbying and printing 
records; and 
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5. save strategies— recall and use saved strategies. 

The retrieval software systems always allow a series of sets to be 
created. These sets consist of pointers to records which can be directly 
reviewed. The sets of pointers can also be used in the argument of the 
selection command. As terms in logical expressions they result in further 
sets as the search strategy is refined. 

Automated translation of retrieval system commands must provide a 
consistent syntax in place of the broaddiversity of construction required by 
the different systems accessed. This requires at the very least a single set of 
command verbs or function names to be used across systems. It further 
requires a unified, or at least consistent, set of conventions in the construc- 
tion of command arguments. Finally, responses from the host should be 
standardized before being presented to the user. 

Commands for the host systems are constructed from: (1) a standard 
command specified by the user, (2) data elements entered by the user, and 
(3) punctuation and other connecting elements required by the host. All of 
these are ordered as required by the host. Responses received fiom the host 
are parsed into the intermediate language tables from which significant 
data elements are extracted and reconstructed in a consistent form for 
presentation to the user. 

The intermediate system should automatically enter and leave the 
"modes" found on hierarchically organized command systems, such as 
BRSand Questel. All information about sets created and commands issued 
in the current session are saved for the duration of the session. As part of 
response parsing, the intermediate system recognizes error messages and 
conditions and can assert failure when an excessive time delay occurs. 

In developing Sci-Mate, directly coded routines have been found to be 
most effective for translating intermediate language data into multiple 
host system retrieval commands. Here, exceptions prevail over rules. Con- 
versely, directly coded translations from the intermediate language into a 
unified user presentation have been found to provide more effective direct 
control than a meta-representation could possibly provide. 

Retrieval languages come in families: Orbit and Elhill share a com- 
mon origin and form one family; Recon, Dialog, and ESA-IRS/Quest 
form another; BRS and DataStar (Switzerland) form a third; and there are 
others.^ In translating retrieval commands and responses, a matrix of 
functions by retrieval language family must be managed. 

The only regular form m potiern in the commands across the lan- 
guage families is the verb-argument arrangement. Even here the verb is 
often implied especially in the selection function. The syntax of each 
argument in each language family follows no pattern that can be observed 
across language families. Figure 1 gives an example of just those cells of the 



EMC 




DESIGN ISSUES 



103 



matrix that contain the Dialog, BRS, and NLM commands for display. 
Figure 2 shows the various ways in which Dialog, BRS, and NLM report 
sets formed. 



SYSTEM VERB ARGUMENT 

Dialog T or <set #> / <format code> /<item list or range> 
TYPE 

BRS ..P or <set #> <formai code> / DOC= <item list or 

..PRINT range> 

NLM PRT or SS <srt#> <format> SKIP <first record # - 1> 

PRINT 

Figure 1. Variations among host requirements for the Online Display 
Command. 



SYSTEM RESPONSE 

Dialog <set #> <postings count> <set formation expression> 
BRS RESULT <postings count> DOCUMENTS 
NLM SS <set #> PSTG <postings count> 

Figure 2. Variations among host responses to Set Formation Commands. 



Modularization remains an important principle and practice. In par- 
ticular, the user interface in Sci-Mate provides a separate module for each 
function. The host interface contains routines for all host functions in a 
single module with each host function managed by one or more proce- 
dures. Both the host and user interfaces draw data from and provide data to 
the Intermediate Language data structures where standardization ulti- 
mately takes place. 

Database Structure: Description and Requirements 

Online bibliographic databases show many parallel structural charac- 
teristics even across hosts. The relative simplicity and consistency of their 
structures make it possible to define and store fairly complete information 
about the structure, but of course not about the content, of most databases. 

The data records themselves are textual with variable length fields. All 
hosts have at least one format in which the fields aie labeled with prefix 
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tags of two to four characters. The labels are followed by one or more lines 
of textual data. The second and subsequent lines of data in a field are 
usually indented to show that they are part of the same field. 

Inverted indexes provide retrieval keys for the dau in most fields. A 
basic index containing terms from all fields is present on many hosts. 
Other fields have their own inverted indexes. Usually the contents of 
inverted indexes can be reviewed starting anywhere and continuing 
through the index in alphanumeric order. 

There are three ways in which term coordination is handled in search- 
ing. Some fields can be searched using only single word terms. Other fields 
precoordinate two or more terms into searchable phrases in addition to 
single word terms. Finally, proximity or adjacency searching, in which 
terms are postcoordinated at the time the command is constructed, is 
allowed in most fields. 

The user's primary requirement at the database level is information 
about the contents and structure of the database. Such information can 
guide the user in the selection of appropriate fields and the construction of 
search expressions. This information also specifies what tags to use to 
designate fields and the acceptable form fcr terms and expressions in each 
field. 

This information can be found in the database provider's documenta- 
tion and in the host system's factsheets. After locating the information, the 
conventional searcher must switch attention between the manual and the 
terminal screen. Users can also experiment while online to determine the 
syntax allowed in a field, but this can be time-consuming and therefore 
costly. Conventional online searching can be enhanced by making these 
tools immediately available on the terminal screen. 

A table of database information is used in the Sci-Ma;e Searcher to 
translate database syntax. Here the task consists of transforming one 
definition of the data structure into another. Both user and host require- 
ments for information about the database are represented in a single entry 
in the table. The table may be called a "knowledge base" since it represents 
expert knowledge about databases. 

In this knowledge base, the tags and usage of database elements 
required by the host are mapped to more complete names and descriptions 
for the same elements stored for users. Users face a mnemonic and encyclo- 
pedic problem in handling details of syntax at the database level. The 
knowledge base is intended to solve this problem with immediate informa- 
tion about the database. 

In addition to information about the fields, certain global informa- 
tion about the database must also be stored. This includes: the name of the 
database as it is to be presented to the user; the name of the database as it is 
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known to the host; and an indication of the host on which the database is 
found. 

The knowledge base stores information for both the user and the host 
system. For the user, the complete names of the fields are presented as part 
of a menu selection. After a selection is made, stored descriptions of the 
fields and its subfields are given as prompts. The user is told whether or not 
phrases are allowed in the field. 

For the host system, the value, placement, and punctuation of the field 
tag is extracted from the table and used in the construction of selection or 
browse commands. If phrases are allowed in the field, the appropriate 
adjacency or proximity symbols are supplied by the table. 

Ck>nsistent User PresenUtion for the Retrieval Process 

Three major language classes used in information retrieval have been 
identified: access protocols, retrieval commands and responses, and data- 
base structures. Numerous specific languages are found within each of 
these classes. For computer mediation, distinct translation methods were 
found to be most effective with each language class. How can these distinct 
translation methods be integrated for a consistent presentation to the user? 

First, what is meant by a consistent user preseritatibri? This at least 
means one in which the user is asked to become accustomed to a manage- 
able, well-defined, and easily learned set of conventions. It also means one 
in which the transitions from one set of options to another fit logically and 
naturally into the user's experience and expectations. It does not mean that 
some particular device or method made possible by the hardware and 
software is necessarily employed. 

Over the past few years there has been a trend away from explicit and 
toward implicit language interfaces for users.^ The oldest form of interac- 
tive interface and the one that traditional online information systems 
continue to take, is the host-prompt/user-command/host-response. This 
derives from the technology and economics of early timesharing. This 
form is one dimensional in that it looks like a simple dialogue alternating 
between a line from the user and one or more lines from the host system. 
Being mnemonic, it requires users to remember or quickly locate details 
about how the language can be used. 

As smart terminals and microcomputers become available, much 
higher display transfer rates are possible. For user interaction, options can 
be economically presented on a menu. The two-dimensional surface of the 
video display unit (VDU), fully refreshed in less than a second, presents the 
user with explicit options for direct selection. Thus interaction requires 
less lobe remembered and less to be entered, as a selection of an entry from a 
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menu is sufficient. The machine becomes proactive as it supplies direct 
and verbal options to its user. Much software for microcomputers, includ- 
ing Sci-Mate, uses this method for interacting with users. 

The newest class of interactive interfaces communicates in still more 
implicit language. New devices and methods give a spatial and environ- 
mental feeling to user interaction. Color graphics and icons communicate 
concepts without using words. Pop-up windows and pull-down menus 
present options instantly without obliterating the current context. Point- 
ing devices such as mice allow metaphorical navigation to every location 
on the screen. Foreground/background activities make users feel that they 
have to wait less time for the machine to complete its tasks. Memory- 
resident utilities allow machines to be used for diverse tasks; applications 
are availale at the touch of a key. 

No retrieval intermediary software developed to date has fully taken 
advantage of the new devices and techniques for interfacing with users. 
Almost all software in the mediation genre uses either a command- or 
menu-driven interface. However, consistency and ease of use are not pre- 
cluded by either of these methods. 

For access protocols, it is sufficient to provide users a method of 
selecting either the database or the host system. A further step is to auto- 
matically select a^database or set of databases depending on the general 
subject area being researched. This was done by In-Search and is being 
done by EasyNet. Such a selection can be performed adequately by either a 
command-driven or menu-driven interface. 

For retrieval commands and responses, users should be able to simply 
specify the function or operation to be performed. Better yet would be to 
provide users with recommendations for actions such as was done by the 
Individualized Instruction for Data Access (IIDA) project.^° In either case, a 
menu of possible or recommended functions continually available to the 
user for selection is helpful. 

Finally, for database structures represented by a knowledge base, a 
menu can effectively provide the field names for selection. The tables are 
further used to present prompts for both field and subfield data. 

Conclusions 

ITiree distinct classes of language have been identified in online 
bibliographic information retrieval. These classes are: access protocol, 
retrieval command/response, and database syntax. Computer interme- 
diary systems that perform language translation should recognize the 
special problems and requirements posed by each class and adapt media- 
tion to fit these problems. 



113 



DESIGN ISSUES 



107 



In developing the Sci-Mate Searcher, ic was found that a special- 
purpose procedural language was most effective for managing access 
protocol; direct computer routines drawing upon intermediate data struc- 
tures were most effective in handling retrieval commands/responses, and a 
knowledge base was most effective in dealing with database syntax. Despite 
the different languages and methods for their translation, the user ca .and 
should be presented with a consistent view of the whole mediated process. 
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User Friendly Future: Applications of 
New Information Technology 

What Is User Friendly? 

This paper considers the clinic theme, *'What Is User Friendly?" from a 
scientific and technical perspective. As Burch has observed in the introduc- 
tion to a bibliography on computer ergonomics and user friendly design, 
the term met friendly is an anomaly as a technical term: **Most words 
borrowed from science enter the popular language stream long after their 
associated discoveries have become history. The term 'user friendly' is an 
exception to this rule; it became popular long before a scientific basis for 
'user friendliness' had even been looked for."^ The current emphasis on 
user friendliness is both market- and technology-driven. There is an inter- 
est in making computers more useful tools for people who are not compu- 
ter specialists, thus expanding the potential user population; and there are 
new technological components that may be employed to make systems 
easier to use. 

Definitions proposed for user friendly/friendliness range from brief 
dictionary definitions (e.g., "a system with which relatively untrained 
users can interact easily") to lists of criteria (e.g., criteria for user friendli- 
ness proposed by Trenner and Buxton).^ Although a review of these 
definitions and criteria is one means of providing a context within which 
to view new technological developments, this paper instead begins with a 
historical perspective, describing selected proposals for user friendly sys- 
tems made over the past forty years. 

Technology Forecasting: Techno-poetic Fantasies 

In an essay introducing the technology section of The New Encyclo- 
paedia Britannica Propaedia, Lord Ritchie-Calder remarks that: "From 
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earliest lime and beginning with the simplest contrivances, every discovery 
and invention has depended on the fact that the human being is not only a 
perceptual but also a conceptual creature capable of observing, memoriz- 
ing, and juxtaposing images. He can make a mental design, a techno- 
poetic fantasy, even when the means of actually producing it are not 
available/'^ In the domain of information system design, there have been a 
number of such techno-poetic fantasies, designs for user friendly systems 
not realizable with the technology available at the time they were pro- 
posed. Rheingold has recently surveyed several of these proposals and the 
people behind them.^ Those described briefly in the following paragraphs 
originated with Bush, Licklider, Engelbart, Nelson, and Kay: memex, 
procognitive systems, the augmented knowledge workshop, hypertext, 
and dynabook. 

Vannevar Bush's article, ''As We May Think," in which he proposed 
memex and other devices, has frequently been cited in the library and 
information science literature since it first appeared in Atlantic Monthly in 
July 1945.^ Less well known is the condensed and illustrated version which 
appeared in Life 10 September 1945, including illustrations of future 
information technology such as memex7 Memex, as envisioned by Bush, is 
a mechanized private file and library. It is "a device in which an individual 
stores all his books, records, and communications, and which is mechan- 
ized so that it may be consulted with exceeding speed and f lexibi li ty. It is an 
enlarged intimate supplement to his memory."® Bush emphasized the 
value of organizing the contents using associative indexing, "whereby any 
item may be caused at will to select immediately and automatically 
another. This is the essential feature of the memex. The process of tying 
two items together is the important thing."^ 

In 1967 Bush had an opportunity to assess how much progress had 
been made toward the construction of memex.^^ He observed that: ''Great 
progress.. .has been made in the last twenty years on all the elements 
necessary. Storage has f>een reduced in size, access has become more rapid. 
Transistors, video tape, television, high-speed electric circuits, have revo- 
lutionized the conditions under which we approach the problem. How- 
ever, Bush was not optimistic that a personal machine woulri be affordable 
in a short time. He did not foresee the rapid progress in integrated circuit 
technology which led to personal computers in the 1970s. 

In 1965 J.C.R. Licklider published Libraries of the Future in which he 
described the likely characteristics of future computer-based information 
systems.^^ He coined the term procognitive systems to differentiate them 
from libraries, since the intent was that such systems "will extend farther 
into the process of generating, organizing, and using knowledge'* through 
interaction among men, computers, and the body of knowledge.^^ Criteria 
to be met by procognitive systems include: converse or negotiate with the 
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user while he formulates his requests and while responding to them; adjust 
itself to the level of sophistication of the individual user, providing terse 
streamlined modes for experienced users working in their fields of expert- 
ness, and functioning as a teaching machine to guide and improve the 
efforts of neophytes; provide the flexibility, legibility, and convenience of 
the printed page at input and output and, at the same time, the dynamic 
quality and immediate responsiveness of the oscilloscope screen and light 
pen." 

In 1982 Licklider had an opportunity to reflect on developments since 
1965.^^ Although he noted considerable advances in the technological 
infrastructure, such as increased storage capacity and the availability of 
networks fur digital transmission of information, he remarked that "the 
practically important application of information technology by libraries 
has not been, the past eighteen years, on any direct path to the procognitive 
system I was trying to describe in Libraries of the Future Nevertheless, 
he concludes by suggesting that, by the year 2000, librarians will have two 
important roles: (1) contributing to the work of the online intellectual 
community involved in generating and using the body of knowledge, and 
(2) organizing and maintaining the body of knowledge which will exist in 
electronic form. 

In 1 963 a series entitled **Vistas in Information Handling" began with 
a volume devoted to The Augmentation of Man's Intellect by Machine}*^ 
The lead paper in that volume, prepared by Douglas C. Engelbart, present- 
ed a conceptual framework for the augmentation of man's intellect.^® At 
the recent Association for Computing Machinery (ACM) Conference on 
the History of Personal Workstations, Engelbart reviewed research con- 
ducted in the intervening years toward realizing the "augmented knowl- 
edge workshop''— the place in which a person finds the data and tools with 
which he does his knowledge work, and through which he collaborates 
with similarly equipped workers. Engelbart feels that human knowledge 
work capability can be enhanced through properly harnessing this new 
technology. Although many of the technologies, both hardware and soft- 
ware, originally developed by Engelbart's group have now made their way 
into commercial products, he concluded his conference presentation on a 
somewhat pessimistic note: "I still don't see clear perceptions about what 
we humans can gain in new capabilities, or about how this may come 
about. There are constant, echoing statements about how fast and smart 
the computers are going to be, but not about how the enhanced computer 
capabilities will be harnessed into the daily thinking and working life of 
our creative knowledge workers."^ 

At a colloquium on information retrieval held in 1966, Theodor H. 
Nelson argued that access to information may not be best accomplished 
either by indexing techniques (document retrieval) or queriable informa- 
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lion networks (content retrieval).^^ As an alternative, he suggested that 
digital text storage and display make possible the creation of hypertext oi 
nonlinear text systems. Hypertext is the combination of natural language 
text with che computer's capacities for interactive, branching, or dynamic 
display; it "may differ from ordinary text in its sequencing (it may bianch 
into trees and networks), its organization (it n)ay havr multiple levels of 
summary and detail), its mode of presentation (it may contain moving or 
manipulable illustrations, moving or flashing typography), and so on."^^ 
Nelson has been pursuing development of the technology required to 
support this concept, as reported in his book Literary Machines,^ 

The final techno-p>oetic fantasy noted here is the dynabook, proposed 
by researchers at the Xerox Palo Alto Research Center.^^ The dynabook 
would be "a personal dynamic medium the size of a notebook.. .which 
could be owned by everyone and could have the power to handle virtually 
all of its owner's information-related needs."^ Alan Kay and Adele Gold- 

26 

berg describe what such a device would be: 

Imagine having your own self-contained knowledge manipulator in a 
portable package the size and shape of an ordinary notebook. Suppose it 
had enough power to ouirace your senses of sight and hearing, enough 
capacity to store for later retrieval thousands of page-equivalents of 
reference materials, poems, letters, recipes, records, drawings, anima* 
tions, musical scores, waveforms, dynamic simulations, and anything 
else you would like to remember and change. 

Although none of these authors used the term user friendly in charac- 
terizing the products of their imagination which are now at least partially 
realizable with available technology, a technologically based definition of 
the concept user friendly should include such visions of the future. In each 
case ease of interaction was taken as a given; instead the focus was on means 
of creating, organizing, searching, and using the contents of the knowl- 
edge base. 



Technology Transfer: Information Technology 

Before turning to a consideration of the rechnological components 
which will form the basis of user friendly systems in the future, it is 
appropriate to note the plethora of periodicals which have emerged in an 
effort to speed the transfer of technology into the library context. Titles 
include Information Technology and Libraries, Program: News of Com- 
puters in Libraries, Small Computers in Libraries, Microcomputers in 
Information Management, Library Software Review, The Electronic 
Library, Electronic Publishing Review, Online, Online Review, Database, 
Library Hi Tech, Library Hi Tech News, Library Technology Reports, 
Information Retrieval and Library Automation, Advanced Technology/ 
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Libraries, and Information Today. Periodicals such as Library Journal 
and Wilson Library Bulletin also now have regular columns devoted to 
library uses of technology. Although sources in the computer science and 
engineering literature must be consulted to follow current research in 
information technology, possibilities for application are documented in a 
reasonably timely manner in the periodicals published for a library and 
information science audience. Given ;he rapidity with which new develop- 
ments occur, the next section simply highlights some of the technological 
components currently available for design and construction of more user 
friendly systems. 

Technological Components: Hardware and Software 

Developments in hardware contrib'Ue to user friendliness by making 
many alternatives first feasible and then economical . Because users of most 
systems can be expected to be a heterogeneous group, choices in hardware 
allow alternative modes of access to be implemented for a given system. For 
example, microcomputers can be substituted for dumb terminals now that 
information processing technology has become relatively inexpensive. 
This enables the system to present alternative interfaces, such as one that is 
menu-driven rather than command-driven. Lx)cal processing also offers 
the possibility of implementing 2?;teways to simplify access to multiple 
systems, masking differences which users may find hard to remember. 

Telecommunications contributes io ease of interaction through the 
transmission speed which can be supported. New types of links using fiber 
optics can support higher speed and larger bandwidth so that more data 
can be transmitted at a faster rate. In addition there are now possibilities for 
integrating voice, text, image, and data communications. 

New forms of storage media make possible local, self-contained infor- 
mation systems as an alternative to interactive access of remote databases. 
In particular the optical disks, such as CD-ROMs, offer large capacity 
storage fcr digital data as well as visual images. Because cost to use such 
systems is no longer a function of connect time to a remote computer, new 
types of interaction which would be too costly in systems charging for use 
by the minute are possible. 

Input/output devices have the most direct impact on perceived user 
friendliness. Input is no longer confined to the QWERTY keyboard which 
anyone but the touch typist may find cumbersome to use. Touching (using 
touch screens) and pointing (using devices such as the mouse) can be used 
to indicate choices in menu-based systems. Output can use printers, plot- 
ters, and display screens with possibilities for different fonts, colors, win- 
dows, and graphics. Although not yet as common, limited voice input and 
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speech output allow the use of sound rather than tactile and visual means 
of recording and reporting. 

Software is of course required to make all these hardware components 
operate. In judging user friendliness, one is concerned with what Shackel 
has termed the "cognitive and software interface."^ Components include 
languages (e.g*, use of command languages v. natural language), informa- 
tion organization, display format and layout, dialogue structure and 
design, error message design, and advanced interfaces (e.g., intelligent 
systems adaptive to the user). Tools are beginning to be available with 
which to design and build many of these components as identified, for 
example, in Bundy's Catalogue of Artificial Intelligence Tools.^ 

Given this wide range of technological components, the challenge is 
to combine elements to create more user friendly systems. As Smith notes, 
there are significant differences between designing hardware and software 
for the user interface.^ Formal standards may be applicable to hardware 
design, but flexible design guidelines rather than standards are applicable 
to software design. For example, Rubinstein and Hersh present a well- 
developed set of guidelines for human-oriented design.^ In general, more 
guideline information is available relating to the physical interface than to 
the cognitive interface.^* 

Technological Integration: Personal Workstations 

Development of {personal wcT*^istations represents the computing 
environment which will form the basis for user friendly systems in the 
fixture. The transition has been characterized by Perlis and White: 
'Twenty five years ago computing was stationary, ponderous and central- 
ized. Its dominant role wa^ to serve the critical needs and purposes of 
organizations and the sciences. Today matters are very different. Computa- 
tion is personal, ubiquitous and expansive. Power is being supplied at and 
to the fingertips of the individual." ^The workstation concept issustained 
by four technologies: dedicated microprocessors, local srea networks, local 
databases, and gateways to mainframes.^ Various input/output devices 
are provided, depending on the tasks which the workstation is designed to 
support. The workstation is used to carry out both generic activities (e.g., 
calculation, word processing, mail) and profession-related activities (e.g., 
scientific or engineering analyses) with appropriate software support. 

These computing and communication systems are already appearing 
in organizations of which libraries are a part, such as universities. At 
Carnegie- Mel Ion University, for example, a system named ANDREW is 
being developed with personal computers, raster graphics, high band- 
width communications, and time-sharing file systems as components.^ 
The designers anticipate that ANDREW will affect university education in 
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four main areas: compuier-assisced instruction, creation and use of new 
tools, communication, and information access. With respect to informa- 
tion access, the designers comment that "a mark of tomorrow's profes- 
sional will be the ability to navigate in large information repositories" 
including the library's database, worldwide databases, and databases devel- 
oped within the university.^ Some predictions of how such systems will be 
used have already appeared. For example. Spinrad offers what he terms 
"vignettes" describing how a typical student, professor, and administrator 
would function in an electronic university.^ and Lancaster describes how 
the scientist could use an electronic infonnation system to create, tnmsmit. 
and receive information.^^ Some of the "techno-poetic fantasies" cited 
earlier also suggest ways in which a personal workstation could be used. 

Technology Assessment: An Appropriate Skepticism 

To provide a balanced discussion of technology in support of user 
friendliness, it is necessary to interject what John Shelton Liwrencc has 
termed "appropriate skepticism." In discussing tlie use of computers for 
word processing, he notes that: ''Computer users often allow their exhila- 
ration with hardware and productivity to displace the criticil attention 
they formerly gave to their manually produced material... .Tlie physical 
appearance of the computer's output is seductive in this regard; because it 
prints absurdity as beautifully as the most carefully wrought expression, 
one is tempted not to look beneath its surface."^ Asimilar danger exists in 
the context of user friendly catalogs and other information systems. Prob- 
lems may arise if the following factors are not taken into consideration. 

Comprehensibility. In a piece entitled "Black Box Blues." Dixon 
remarked that "tlie real danger of the microelectronic era is posed by what 
was called, even in the days of macroelectronics. tlie black box mentality: 
passive acceptance of the idea that moreandnioreareasof life will betaken 
over by little black boxes whose mysterious workings are beyond our 
comprehension."^ Tlie algorithms followed by computers are not neces- 
sarily comprehensible to users. Yet by knowing the basis for system deci- 
sions, the user can more appropriately accept, reject, or modify them. 
Designers must determine the extent to which computer processes should 
be made explicit rather than hidden. 

Scope of the system. A great deal of effort can be expended to no 
ptirpose if the user seeks information which in fact is not contained in the 
system. In order to use the system intelligently, a user needs to understand 
its scope— i.e.. the broad class of questions to which the system is designed 
to respond. 

Limitations of the system. T!ie attempt to make human-computer 
dialogues more like human-human dialogues may lead to an overly 
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anthropomorphic interpretation of the computer .system by users. Without 
a way to probe the limits of capabilities of a human-like system, the user is 
likely to attribute more power lo it than it actually has. 

Source of information. When information is sought f om printed 
sources or from other people, the inquirer has some basis forjudging the 
authoritativencss of the material or the response. By masking aspects of the 
search process fro..* the user— such as database selection— and by present- 
ing isolated responses— whether citations or facts— the inquirer has no 
basis for judging the domain covered or the reliability of the response. 

Mastery of the system. In a piece entitled ''Can Online Gitalogs Be 
Too Easy?" Arret pointsout that user easy isnotuserfriendly if progressive 
learning and system mastery are siicrificed.^^ If the^rc is no way for the user 
to advance beyond the simple searches supported by the user friendly 
interface, then there is no way that the full ix>wer of the system can l>e 
exploited. 

In the spirit of technology assessment, a discussion of the technology 
supporting user iriendly systems must acknowledge these potential piob- 
lems. Given the current limitations of user friendly systems, users must 
develop an appropriate skepticism and designers must exploreapproachcs 
to deal with issues such as those enumerateu earlier. 

Halfway Technology Versus High Technology 

In an essay on the techrology of medicine written in 1971, lliomas 
introduced a distinction between what he termed '*halfway technology** 
and "high technology."^* He explained that halfway technology is chanic- 
tcrized by things done afitr ihe fact in efforts to compensate for the 
incapacitating effects of certain diseases. He noted that the real high 
lechnolo:;)' of medicinf comes as the result of a genuine understanding of 
disease mechanisms, allowing prevention and/or effective treatment. 
Interpreting Ci"iCse concepts in the content of information technology, one 
cou!d describe efforts to design more user friendly interfaces to existing 
systems as halfway technology, trying to improve access to systems not 
initially designed from the perspective of user needs. To achieve high 
technology, research is required to understand the needs of the user far 
better than is the case toAiy. Tliis theme is echoed by Qiapiuiis who talks of 
''taming and civilizing computers" by discovering enough about human 
behavior to design computer systems for enhancement and enrichment/*^ 
and by Birnbaun:' ivho notes that the ''domestication of microelectronics" 
will only be achieved by developing computer technology in the context of 
what the user wants to do.^^ At present iht hardware is far ahead of theory 
and research in user customization. Fortunately, there is an increasing 
amount of interest and research activity in this area, drawing on behavioral 
scientists as well as computer scientists. 
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User Friendly Future 

This discussion began with the observation that user friendly is an 
anomaly as a technical term. Nick^rson has suggested a simple alternative 
which may prove more satisfying:^* 

Whether "friendliness" is the right concept is perhaps a matter of taste. 
"Usability" strikes me as ihe more appropriate and completely adequate 
concept; in imputing the quality of friendliness to a machine, one is 
diluting the meaning of one of the most pleasant of words. 

And Burch in turn offers a measure of usability:*^ 

System transparency is the ultimate, ideal measure of computer usabil- 
ity. It is achieved when a system's overall design is so compatible with the 
way the user thinks, talks, listens, remembers, perceives, processes infor- 
mation, asks questions, makes decisions, and solves problems, that the f 
system itself requires none of the user's attention and, in effect, becomes 
invisible. It happens in the same way that a reader curled up with a good 
book becomes unaware of the paper, the typeface, the book itself, or the 
room around him. 

The current concern for user friendliness can be viewed as an attempt to 
cope with halfway technology. Future attention to usability and usefulness 
may lead the way toward high technology. 
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